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Abstract 


The  class  of  design  synthesis  problems  encompasses  a  wide  spectrum  of  common  encountered  prob¬ 
lems,  including  robot  planning  problems,  synthesis  of  electronic  circuits,  chemical  synthesis,  genetics 
experiment  design,  and  computer  program  synthesis.  This  thesis  is  in  two  main  parts,  both  dealing 
with  design  synthesis.  The  first  part  is  the  Residue  Method,  an  abductive  approach  to  design  synthe¬ 
sis,  and  the  second  is  supersumption,  a  generalization  of  consistency  checking  of  partially  completed 
designs. 

The  Residue  Method  synthesizes  designs  by  reduction  of  the  design  goal  to  another,  primitively 
achievable  goal.  The  reduced  goal  must  be  consistent  with  known  facts  about  the  world,  must  be 
sufficient  to  achieve  the  original  goal,  and  must  be  a  conjunction  of  formulas  from  a  language  of 
primitively  achievable  formulas.  The  Residue  Method  expresses  the  design  goal,  the  final  design,  and 

all  intermediate  designs  as  formulas  of  first-order  logic.  The  usual  approach  in  deductive  synthesis  j 

has  been  to  express  designs  as  a  single  term  of  composed  state  transformation  functions.  Expression 
of  designs  as  a  formula  rather  than  a  term  simplifies  synthesis  of  non-linear  plans,  allows  postponing 
imposition  of  ordering  constraints,  and  allows  one  to  reason  directly  about  the  proposed  design. 

Soundness  and  completeness  results  are  given  for  two  resolution-based  residue  procedures. 

Supersumption  is  an  attempt  to  exploit  the  consistency  requirement  in  order  to  accelerate  syn¬ 
thesize  of  designs.  Not  only  is  consistency  of  a  partially  completed  design  checked,  but  additional 
“ramifications’*  may  be  derived  that  must  be  true  for  the  partial  design  to  remain  consistent.  By 
making  sure  that  the  ramifications  are  not  violated,  one  avoids  searching  parts  of  the  search  space 
that  do  not  contain  legal  designs.  In  addition,  knowing  ramifications  may  make  additional  search 
control  heuristics  directly  applicable.  The  process  of  imposing  additional  constraints  on  a  subgoal  is 
called  “supersumption ."  Two  phenomena  are  described  by  which  supersumption  can  speed  up  the 
search  —  use  of  ramifications  as  better  generators  of  candidates  than  the  unadorned  goal,  and  use 
of  ramifications  as  filters  to  quickly  eliminate  inconsistent  designs.  Two  resolution-based  methods 
for  deriving  ramifications  are  given,  along  with  soundness  and  completeness  results. 
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Chapter  1 


Introduction 


Everyone  designs  who  devises  courses  of  action  aimed  at  changing  existing 
situations  into  preferred  ones.  The  intellectual  activity  that  produces  material 
artifacts  is  no  different  fundamentally  from  the  one  that  prescribes  remedies  Jor 
a  sick  patient  or  the  one  that  devises  a  new  sales  plan  for  a  company  or  a  social 
welfare  policy  for  a  state. 


Herbert  Simon,  The  Science  of  Design  1 


1 . 1  Overview 


Robot  planning,  genetic  synthesis,  chemical  synthesis,  circuit  design,  and  program  synthesis 
are  but  a  few  examples  of  synthesis  or  design  problems.  In  each  of  these  domains  the  design 
process  is  that  of  finding  a  composition  of  known  types  of  components  to  form  a  whole 
meeting  given  specifications.  Almost  from  the  beginning  of  the  study  of  Artificial  Intelligence 
m  t.'ite  19-50’s,  researchers  have  sought  to  automate  the  process  of  design  synthesis.  A  great 
many  systems  have  been  developed,  some  general  and  others  special-purpose,  some  formal 
in  approach  and  others  less  formal. 

This  dissertation  attempts  to  find  a  formal  framework  that  captures  the  notion  of  the 
design  process  as  making  and  using  a  possibly  conflicting  series  of  design  decisions  to  restrict 
the  set  of  candidate  designs.  Three  main  facets  of  the  design  process  are  explored: 

1.  Residue,  a  deductive  framework  for  synthesis  in  which  designs  are  represented  as 
sets  of  formulas. 

'From  Tin  Sciences  of  the  Artificial,  Second  Edition,  The  MIT  Press,  Cambridge.  Massachusetts, 
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CHAPTER  1.  INTRODUCTION 


2.  Supersumption,  a  technique  for  reformulation  of  design  goals  via  added  constraints. 

3.  Finding  Ramifications,  procedures  for  generating  additional  constraints  that  must 
be  satisfied  in  order  for  a  design  to  be  found.  These  constraints  are  found  as  part  of 
checking  the  consistency  of  a  partially  completed  design. 

1.1.1  Residue  —  Representation  of  Designs  as  Formulas 

To  solve  a  design  synthesis  problem  is  to  map  one  specification  into  another.  One  starts 
with  a  specification  G  (the  goal)  of  what  needs  to  be  true  of  the  object  designed.  A  new 
specification  D  (the  design )  is  sought,  such  that  D  not  only  guarantees  that  G  will  be  met, 
but  such  that  it  is  possible  to  implement  D,  and  where  D  is  specified  at  such  a  level  that 
the  intended  implementor  needs  no  further  instruction. 

Chapter  2  presents  such  a  formulation  of  design  synthesis  expressed  in  first-order  predi¬ 
cate  calculus.  It  is  assumed  that  there  is  a  consistent  set  W  of  axioms  describing  the  world 
and  that  the  goal  G  is  expressed  as  a  single  formula.  Furthermore  it  is  assumed  that  there 
is  a  language  A  of  “assumable”  formulas,  that  is,  all  formulas  that  specify  an  instruction 
simple  enough  for  the  implementor  to  carry  out  without  further  instruction.  It  is  assumed 
that  there  is  an  algorithm  that  decides  in  negligible  time  whether  an  arbitrary  formula  is 
in  A  or  not.  For  convenience,  let  us  also  assume  that  a  conjunction  of  assumable  formulas 
is  assumable,  that  is,  if  Av, . . . ,  An  £  A  then  A\  A  . . .  A  An  €  A. 

To  be  a  legal  design  specification  a  set  of  formulas  D  =  {Di, . . . ,  Dm}  must  be  such  that 

1.  W,D  f=  G  (The  design  achieves  the  goal), 

2.  W  U  D  is  satisfiable  (The  design  is  consistent  with  the  world  model),  and 

3.  For  all  G  D,  D,  £  A.  (The  design  is  expressed  in  terms  of  the  design  primitives). 

Any  set  D  with  the  above  properties  is  called  a  residue.2  Sometimes  it  will  be  convenient 
to  view  D  as  a  single  formula  D  =  Dj  A  . . .  A  Dm.  Each  fact  D{  can  be  seen  as  a  commitment 
or  constraint  upon  the  design,  that  is,  a  design  decision  on  the  part  of  the  designer.3 

2One  might  call  this  approach  to  design  an  abductive  approach,  though  not  without  some  confusion.  The 
term  abduction  or  apogage  dates  back  to  Aristotle.  An  abduction  is  “a  syllogism  whose  major  premise  is 
known  to  be  true  but  whose  minor  premise  is  merely  probable.’’  (  The  Encyclopedia  of  Philosophy,  Macmillan 
Publishing  Co.,  Inc.  tc  The  Free  Press,  New  York,  1967,  page  5-57).  See  also  Hempel  [42].  Charniak  and 
McDermott  [16]  use  the  term  in  a  similar  way  in  describing  generation  of  explanations.  In  addition,  Charles 
Sanders  Peirce  (1839-1914)  [36]  used  “abduction”  to  mean  the  “creat  ive  formulation  of  statistical  hypotheses" 
( Encyclopedia  of  Philosophy,  page  4-176). 

3The  idea  of  a  design  being  built  up  as  a  sequence  of  decisions  or  constraints  is  not  a  new  one,  in 
fact,  virtually  any  search  process  can  be  viewed  as  a  sequence  of  implicit  or  explicit,  decisions.  Heuristic 
Dendral  [8],  REF-ARF  [24]  and  Stefik’s  MObGEN  planner  [93,92]  are  important  examples  of  synthesis 
procedures  in  which  the  decisions  are  explicit. 


1.1.  OVERVIEW 
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Representation  of  a  design  as  a  set  of  assumable  facts  is  called  the  Residue  Approach  in  this 
work. 

In  logic,  a  term  denotes  an  individual  of  the  domain  (or  universe  of  discourse ,  as  it 
is  sometimes  called);  a  formula  denotes  a  proposition  about  the  world.  In  the  residue 
approach,  the  domain  consists  of  possible  design  components  and  parameter  values.  Terms 
denote  such  components  or  values,  and  formulas  express  propositions  about  those  terms. 
For  example,  to  design  a  combinational  circuit,  the  domain  might  be  the  possible  NAND- 
gates,  wires,  inputs  of  N  AND-gates,  and  outputs  of  N  AND-gates.  Synthesis  of  RC-circuits 
via  the  residue  approach  might  entail  a  domain  of  wires,  resistors,  capacitors,  resistors, 
capacitances,  and  resistances.  Decisions  about  the  design  would  be  expressed  as  formulas 
denoting  propositions  about  the  wires,  resistors,  resistances,  etc. 


Example  1.1  Consider  a  planning  problem  in  which  the  designer  has  decided 
that  (1)  a  puton(A,f?)  action  will  be  executed  and  (2)  a  puton(B,C)  will  be 
executed.  These  two  decisions  can  be  represented  via  the  two  formulas: 

Execution(puton(  A,  B),  Tj) 

Execution(puton(R,  C ),  T2). 

Puton  is  a  function  mapping  two  blocks  into  an  action.  Execution  is  a  relation 
on  an  action  a  and  a  time  t  and  denotes  the  proposition  that  action  a  will  be 
executed  at  time  t.  Note  that  the  above  two  facts  in  no  way  determine  whether 
time  Ti  is  before  or  after  time  T2. 


The  Single- Term  Approach  Deductive  synthesis  research  (See,  for  example,  Green  [32], 
Waldinger  and  T  ee  [99],  Luckham  and  Nilsson  [54],  Manna  and  Waldinger  [58,59],  and  Wos, 
et  al  [106].)  has  traditionally  represented  designs  as  a  single  term,  that  is,  both  completed 
designs  and  incomplete  designs  are  represented  as  a  composition  of  functions.  For  example, 
a  robot  plan  to  put  block  B  on  block  (7  and  then  put  block  A  on  block  B  might  be 
represented  as 

puton(  A,  B,  puton(5,  C,  5'0)),  (1) 

where  S0  is  the  initial  state  of  the  system,  and  puton  is  a  function  mapping  two  blocks  and 
a  state  to  a  state.  The  above  approach  of  representing  the  design  via  a  single  term  will  be 
called  the  single-term  approach. 

The  single-term  approach  requires  that  all  designs,  both  completed  and  incomplete,  be 
a  part  of  the  domain.  It  is  not  difficult  to  imagine  finding  a  set  of  functions  with  which  to 
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build  terms  expressing  completed  designs,  but  a  set  of  functions  for  expressing  all  desired 
incomplete  designs  is  more  problematic. 


Example  1.2  Consider  the  example  from  (1)  of  the  robot  plan  expressed  as 
puton(A,  B,  puton(S,  C ,  So)).  If,  during  the  design  process,  it  is  determined  that 
the  plan  should  contain  a  puton(A,  B )  action  and  also  a  puton(B,C)  action  with¬ 
out  specifying  their  order,  it  is  not  clear  how  to  express  the  incomplete  plan  as  a 
single  term.  Instead,  one  must  choose  either  the  term  puton(  A,  B,  puton(R,  C,  So)) 
or  the  term  puton(J9,  C,  puton(A,  B,  So)). 

The  difficulty  in  the  above  example  is  that  the  set  of  completed  designs  is  easily  rep¬ 
resented  as  a  single  term,  but  the  set  of  desired  partial  designs  is  much  bigger  and  not  as 
ammenable  to  representation  as  a  single  term.  Note  that  in  order  to  express  the  desired 
information,  the  partial  design  had  to  be  constrained  more  than  necessary. 

Advantages  of  the  Residue  Approach  The  thesis  claims  that  it  is  beneficial  to  rep¬ 
resent  designs,  especially  incomplete  designs,  as  sets  of  formulas  rather  than  single  terms. 
The  following  reasons  will  be  elaborated  in  Chapter  2: 

1.  As  illustrated  in  Examples  1.1  and  1.2,  the  residue  approach  is  more  expressive  than 
the  single-term  approach;  many  design  decisions  can  be  expressed  easily  in  the  residue 
approach,  but  can  only  be  expressed  via  a  stronger  constraint  in  the  single-term 
approach.  In  such  a  case,  the  lack  of  expressiveness  of  the  single-term  approach  can 
result  in  unnecessary  backtracking.4  Attempts  to  avoid  such  difficulties  in  the  single- 
term  approach  are  fraught  with  difficulties. 

It  should  be  noted  that  syntactically,  any  set  of  formulas  can  be  encoded  as  a  compo¬ 
sition  of  functions  —  one  need  only  define  a  new  (n  +  l)-ary  function  for  each  n-ary 
relation  and  connective  in  the  language.  Thus,  to  say  that  single  terms  are  inherently 
less  expressive  than  sets  of  formulas  is  in  some  sense  incorrect.  On  the  other  hand, 
such  an  encoding  of  a  set  of  formulas  begs  the  question.  Instead  of  denoting  a  set  of 
state  transformations  constituting  a  prescription  for  a  design,  such  a  composition  of 
functions  would  denote  any  design  for  which  all  the  encoded  propositions  hold  —  a 
perverse  way  of  taking  the  residue  approach. 


'Similar  ideas  were  expressed  in  Stallman  and  Sussman’s  EL  [90]  and  in  the  least  commitment  cycle  of 
Stefik’s  planning  engine  [93,92]. 
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2.  Just  as  incomplete  designs  are  better  represented  as  sets  of  facts  than  as  single  terms, 
one  might  desire  a  greater  expressness  for  complete  designs  as  well.  In  Example  1.2, 
for  example,  it  might  not  matter  which  action  comes  first,  or  even  whether  they  are 
executed  in  parallel.  As  such,  it  is  usually  desirable  not  to  commit  to  one  ordering  or 
the  other  (as  demanded  by  the  single- term  approach). 

3.  For  reasoning  about  control  strategies,  performing  consistency  checking  or  finding  su¬ 
persumptions  (presented  in  Chapters  3  and  4),  it  is  important  to  be  able  to  reason 
about  the  design.  An  inference  engine  can  reason  directly  from  a  set  of  facts  using 
well-understood  mechanisms  of  logic.  In  contrast,  reasoning  about  a  single  term  is  an 
ad  hoc  process. 

4.  There  are  problems  for  which  the  full  expressiveness  is  not  needed,  that  is,  the  ex¬ 
pressiveness  of  single-term  approach  is  perfectly  adequate.  For  such  cases,  the  residue 
approach  can  be  used  on  these  problems  as  efficiently  as  the  single-term  approach. 

The  above  formulation  of  design  problems  is  not  operational  —  there  must  be  a  proce¬ 
dure  for  finding  residues.  Chapter  2  presents  two  such  procedures  for  generating  residues, 
Resolution  Residue  and  Ordered  Residue.  In  addition,  it  proves  appropriate  completeness 
results  for  the  two  procedures. 

1.1.2  Supersumption  —  Faster  Solution  via  Added  Constraints 

In  solving  design  problems  via  various  residue  procedures,  it  was  noticed  that  the  systems 
were  not  able  to  take  advantage  of  information  gained  during  consistency  checking.  Rather 
than  just  knowing  that  a  design  is  consistent,  it  was  desired  to  know  what  conditions  will 
have  to  hold  if  the  design  is  to  remain  consistent.  Such  a  condition  can  then  be  incorporated 
into  the  goal  to  avoid  needless  search. 

Supersumption  is  a  such  technique  for  incorporating  these  conditions,  reformulating 
goals  in  such  a  way  that  the  cost  of  solving  the  reformulated  goal  can  be  less  than  the  cost 
of  solving  the  original  goal. 


Example  1.3  Suppose  a  personal  computer  owner  has  numerous  250  Kbyte 
floppy  disks,  a  50  Mbyte  hard  disk,  and  the  goal, 


“Find  all  disk  files  larger  than  1  Mbyte.’’ 
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Due  to  its  size,  no  such  file  could  possibly  exist  on  a  floppy  disk,  but  would  have 
to  be  on  the  hard  disk.  The  reformulated  goal, 

“Find  all  hard  disk  files  larger  than  1  Mbyte” 

would  (1)  have  the  same  set  of  solutions  as  the  original  goal  and  (2)  presumably 
be  cheaper  to  solve  than  the  original  goal  in  that  it  avoids  enumerating  the 
floppy  disk  files. 

In  the  above  example,  the  reformulated  goal  has  the  same  set  of  solutions  as  the  orignal, 
but  this  need  not  be  the  case: 


Example  1.4  Suppose  the  computer  owner  of  Example  1.3  is  given  the  goal, 

“Find  a  disk  file  larger  than  200  Kbytes.” 

If  the  owner  knows  that 

“Most  files  larger  than  128  Kbytes  are  on  hard  disk,” 

he  might  reformulate  the  goal  as, 

“Find  a  hard  disk  file  larger  than  200  Kbytes.” 

The  reformulated  goal  may  eliminate  some  solutions  to  the  original  goal,  but  (1) 
some  solution  is  likely  to  be  found,  and  (2)  the  reformulated  goal  is  presumably 
cheaper  to  solve  than  the  original  goal  in  that  it  avoids  enumerating  the  floppy 
disk  files.  The  owner  may  choose  to  remember  the  original  goal  in  case  no 
solution  is  found  for  the  reformulated  goal. 

In  both  of  the  above  examples,  an  original  goal  was  constrained  by  additional  require¬ 
ments.  In  Example  1.3  the  additional  requirement  (that  the  file  be  on  the  hard  disk)  is 
logically  implied  by  the  goal  G,  any  design  decisions  D  (none  in  this  problem),  and  the 
known  facts  W  about  the  world.  The  subclass  of  logically  implied  additional  constraints 
will  be  called  ramifications.  In  contrast,  the  additional  requirement  in  Example  1.4  is  not 
a  logical  implication  of  the  goal  and  the  known  facts  about  the  world;  it  is  likely  to  be 
true  of  any  solution,  but  some  solutions  of  the  original  goal  may  not  be  solutions  of  the 
reformulated  goal.  Such  a.  constraint  will  be  called  a  probable  constraint. 

The  above  examples  can  be  characterized  by  taking  a  goal  G  and  reformulating  the 
goal  to  be  G  A  A.  Chapter  3  develops  the  above  notion  of  supersumption ,  the  conjoining 
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of  additional  constraints  to  an  existing  goal  or  subgoal.  Supersumption  may  be  done  with 
any  additional  constraint,  either  necessary  (that  is,  a  ramification)  or  not,  but  the  thesis 
concentrates  on  ramifications  alone. 

The  above  reformulations  via  supersumption  are  not  sufficient  to  bring  about  a  speedup, 
however.  In  additional  there  must  be  a  strategy  for  taking  advantage  of  the  added  constraint. 

Example  1.5  Suppose  the  goal  specification  of  Example  1.3  is  written  as 

Find  an  x  such  that:  File(z)  A  (Size(x)  >  1Mbyte). 

Addition  of  the  ramification  Hard-Disk-File(x)  produces  the  goal 

Find  an  x  such  that:  File(ar)  A  (Size(x)  >  1Mbyte)  A  Hard-Disk-File(x), 

but  says  nothing  about  how  to  use  Hard-Disk-File(x)  in  speeding  up  the  search. 

A  speedup  is  obtained  only  by  specifying  a  processing  method  such  that  the  x 
such  that  Hard- Disk- Fi!e(x)  are  enumerated  rather  than  the  x  such  that  File(x). 

In  this  thesis,  the  only  strategy  considered  will  be  reordering  the  conjuncts  of  a  con¬ 
junctive  goal,  the  subject,  of  research  by  David  E.  Smith  [86,85]. 

1.1.3  Ramifications  —  Generalizing  Constraint  Propagation 

As  mentioned  in  Section  1.1.2,  an  important  class  of  supersumptions  is  supersumption  with 
ramifications ,  that  is,  with  constraints  logically  implied  by  the  current  goal  G  (as  opposed  to 
the  original  goal),  the  world  model  W,  and  the  design  decisions  D  to  date.  A  ramification  N 
of  a  goal  G  and  design  decisions  D  is  a  formula  such  that  the  goal  has  no  solution  (given  the 
design  decisions  to  date)  for  which  the  ramification  does  not  hold  as  well.  In  other  words, 
if  D  =  Dj  A  . . .  A  D„, ,  then  N  is  a  formula  such  that  W  \=  (DAG)dN. 

Example  1.6  In  Example  1.3,  a  personal  computer  owner  has  numerous  256 
Kbyte  floppy  disks,  a  50  Mbyte  hard  disk,  and  the  goal, 

“Find  all  disk  files  larger  than  1  Mbyte.” 

As  before,  no  such  file  could  possibly  exist  on  a  floppy  disk,  that  is,  the  condition 

“the  files  are  all  on  hard  disk” 


is  a  ramification  of  the  above  goal;  there  is  no  design  for  which  the  ramification 
does  not  hold. 
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Ramifications  and  Consistency  Checking  Ramifications  can  be  generated  as  a  part  of 
checking  consistency  of  a  design,  complete  or  incomplete.  If  a  partial  design  is  inconsistent, 
there  is  no  consistent  complete  design  incorporating  all  the  decisions  of  the  partial  design, 
that  is,  some  design  decision  has  to  be  relaxed.  If  consistency  checking  were  free,  it  would 
always  pay  to  know  whether  the  current  partial  design  is  consistent.  As  it  turns  out, 
complete  consistency  checking  is  in  general  intractable  and  can  only  be  approximated.  If 
a  problem  is  expressed  in  first-order  logic,  then  inconsistent  designs  can  be  proven  to  be 
inconsistent,  but  not  in  a  bounded  amount  of  time.  In  general,  there  is  no  way  to  be  sure 
that  a  design  is  consistent. 

Inconsistency  can  be  proven  by  showing  that  false  logically  follows  from  (1)  D,  the  design 
decisions  to  date,  (2)  G,  the  goal  at  hand,  and  (3)  the  world  model  W.  In  other  words,  if 
false  is  a  ramification  of  the  current  G  and  D,  then  the  current  partial  design  is  inconsistent. 
In  the  process  trying  to  deriving  false,  one  derives  other  ramifications.  In  Chapter  4  it  is 
shown  exactly  what  other  ramifications  will  be  derived  by  various  procedures  for  checking 
consistency. 

By  recording  ramifications,  one  gets  additional  information  in  answering  the  question, 
“Is  the  design  to  date  consistent?”  Instead  of  a  yes/no  answer,  one  receives  an  answer  of 
“No,”  or  an  answer,  “Yes,  the  partial  design  is  consistent,  but  ramification  A  must  hold  for 
all  complete  designs  incorporating  this  partial  design.” 

Ramifications  and  Constraint  Propagation  Finding  ramifications  may  rightfully  be 
viewed  as  a  generalized  form  of  constraint  propagation,  a  way  to  fit  constraint  propagation 
into  a  deductive  approach.  Constraint  propagation  is  usually  thought  of  as  a  specific  infer¬ 
ences  to  be  carried  out  when  certain  triggering  conditions  are  met,  for  example  in  Waltz’s 
line  labelling  program  [103,102].  in  Stallman  and  Sussman’s  EL  [90],  or  in  St^fik’s  con¬ 
straint  posting  [93,92].  In  this  research  ramifications  are  found  via  the  general  mechanism 
of  forward  inference.  Instead  of  making  a  specific  inference,  one  faces  a  search  problem  with 
difficult  questions  of  (1)  how  to  search  and  (2)  what  constitutes  a  useful  ramification. 


1.2  Approach  and  Scope  of  the  Thesis 

1.2.1  Deductive  Synthesis 

The  approach  taken  in  this  thesis  is  that  of  deductive  synthesis.  Deduction  is  defined  as,  “the 
act  or  process  of  reasoning,  especially  a  logical  method  in  which  a  conclusion  necessarily 
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follows  from  the  propositions  stated.”5  Deductive  synthesis  refers  to  the  construction  of  an 
implementable  specification  of  an  object  as  part  of  the  proof  of  a  theorem  that  the  object 
meets  a  certain  (possibly  abstract)  specification.  In  this  thesis,  the  world  model  W,  the 
design  goal  G,  and  the  design  D  itself  are  all  represented  as  formulas  or  sets  of  formulas  in 
first-order  predicate  calculus,  but  they  need  not  be;  there  is  no  reason  not  to  do  deductive 
synthesis  in  some  other  logic  (See,  for  example  Konolige  [46]).  There  are  also  numerous 
alternatives  to  deductive  synthesis  systems  (See,  for  example,  Burstall  [11],  Manna  and 
Waldinger  [60],  Barstow  [2],  and  Green  [33]).  The  fact  that  a  system  is  not  deductive  does 
not  mean  that  its  output  is  more  or  less  believable  than  a  deductive  system’s  output  —  a 
deductive  system’s  deductions  are  only  as  good  as  its  axiomatization  of  the  world. 

Taking  a  deductive  approach  implies  a  declarative  approach  to  knowledge  representation. 
In  declarative  approaches,  there  is  a  generally  strict  separation  of  control  of  search  and 
inference.  Instead  of  expressing  algorithms  procedurally,  one  expresses  a  particular  strategy 
to  control  the  search  through  the  space  of  possible  inferences,  where  the  inference  engine 
is  making  inferences  (sound  or  otherwise)  from  some  body  of  information  about  the  world. 
The  declarative  approach  has  its  roots  in  a  number  of  AI  systems  based  upon  theorem 
proving.  Some  notable  examples  are  QA3  [32],  STRIPS  [25],  and  FOL  [26].  Pat  Hayes’ 
early  papers  [39,38,37]  on  declarativism  were  also  seminal.  A  declarativist  view  is  part 
and  parcel  of  logic  programming,  and  such  a  view  has  been  embodied  in  such  systems  as 
Prolog  [78]  and  MRS  [27,79].  See  also  Kowalski  [48,47]  for  early  expressions  of  declarativism 
as  it  related  to  logic  programming. 

In  design  synthesis,  one  hopes  for  a  number  of  advantages  in  separating  control  knowl¬ 
edge  and  world  knowledge: 

1.  Knowledge  about  design  components  is  expressed  independent  of  its  use.  The  set  of 
facts  about  design  components  need  only  reflect  what  is  true  in  the  world,  and  can  be 
checked  independent  of  the  design  engine. 

2.  A  different  design  engine,  i.e.,  different  control  strategy  or  different  inference  engine, 
might  use  the  same  body  of  knowledge  about  the  particular  design  domain.  Research 
on  control  of  inference  becomes  applicable  to  particular  problems  of  a  declarative  sys¬ 
tem. 

3.  Evolution  of  knowledge  about  the  domain  does  not  require  changes  in  the  design  en¬ 
gine  (although  changes  for  the  sake  of  efficiency  might  be  advisable). 

4.  By  compiling  the  set  of  inferences  made  by  a  particular  design  engine  on  a  particular 
set  of  rules,  one  can  achieve  the  same  speed  as  with  procedural  systems. 

'The  /lmericori  Heritage  Dictionary,  Dell  Publishing  Co.,  Inc.,  New  York,  NY,  Ui8:{. 
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The  above  (or  similar  )  claims  for  declarative  representation  have  often  been  heard  in  the 
declarative/procedural  controversy  that  has  been  raging,  on  and  off,  since  the  early  1970’s. 
After  fifteen  years  of  dispute,  the  final  results  and  outcome  of  the  controversy  are  not  entirely 
clear.  See,  for  example,  VVinograd  [104],  or  the  Handbook  of  Artificial  Intelligence  [1]  for 
discussions  of  the  Declarative/Procedural  Controversy.  See  McDermott  [68]  for  the  current 
view  of  a  discouraged  declarativist. 

1.2.2  Number  of  Solutions  to  a  Problem 

There  are  many  sorts  of  goals6  for  which  a  problem  solver  might  be  asked  to  find  a  solution. 
Examples  might  include: 

•  Does  there  exist  a  file  written  in  the  last  hour? 

•  Find  all  files  written  in  the  last  hour. 

•  Find  any  file  written  in  the  last  hour. 

•  Find  4  files  written  in  the  last  hour. 

•  Find  the  largest  file  written  in  the  last  hour. 

In  the  relational  database  literature,  the  standard  problem  is  to  find  all  tuples  meeting  some 
criterion,  for  example,  all  files  written  in  the  last  hour,  or  all  flights  between  San  Francisco 
and  Denver  leaving  between  1  p.m.  and  4:30  p.m.  on  January  10.  In  contrast,  design 
problems  usually  require  only  one  solution.  We  do  not  care  about  finding  all  circuits  to 
shift  bits  in  a  24  bit  word,  in  fact,  there  is  usually  an  infinite  number  of  solutions  to  such 
problems,  anyway.  Instead,  one  must  find  at  least  one  solution. 

The  residue  approach  is  geared  toward  finding  single  solutions.  By  using  an  agenda 
mechanism  for  its  search,  a  residue  procedure  can  also  find  multiple  solutions.  To  find  the 
next  solution,  the  procedure  can  simply  be  restarted  with  the  agenda  in  the  state  where 
it  left  off.  To  find  all  solutions,  the  residue  procedure  must  be  called  until  its  agenda  is 

6The  word  goal  is  commonly  used  in  at  least  two  ways.  The  goal  might  be  a  desired  final  state,  or  it 
might  be  the  path  by  which  one  arrives  at  that  final  state.  Sometimes  the  final  state  is  described  in  terms 
of  that  path,  as  is  the  case  in  the  single-term  approach.  In  Chapters  2-5,  goals  are  the  former,  that  is,  a 
description  of  the  final  destination  or  state  in  which  one  desires  to  be.  In  this  chapter,  the  word  goal  is  used 
a  bit  more  loosely.  For  example,  in  the  “goal  types”  enumerated  below  in  the  text,  the  notion  of  goal  is 
of  yet  another  variety,  that  is,  an  expression  of  a  task  to  be  carried  out;  it  falls  into  neither  the  first  nor 
the  second  notion  of  goal  described  above.  Since  the  proper  notion  of  goal  is  clear  from  the  context  in  this 
and  other  examples  of  the  present  chapter,  the  different  notions  of  goals  will  not  be  further  distinguished. 
In  future  chapters,  goals  will  be  specified  according  to  the  first  notion,  that  is,  a  description  of  the  desired 
state  for  which  we  inav  find  zero,  one,  or  many  solutions. 
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exhausted  or  it  is  known  that  all  solutions  have  been  found.7  Thus,  for  all  of  the  above  goal 
types  except  the  last,  an  agenda-based  residue  procedure  can  provide  answers  by  simply 
being  called  the  appropriate  number  of  times.  The  last  goal  type  above  is  not  covered  in  this 
work  because  it  requires  an  additional  mechanism  for  looking  over  the  set  of  all  solutions 
generated. 

The  notation  for  queries  used  in  the  rest  of  this  thesis  is  adapted  from  D.  E.  Smith  [85] 
and  is  as  follows:  Queries  are  expressed  in  the  form  “find  n  v:  g,”  where  g  is  a  formula 
containing  zero  or  more  free  variables,  n  is  the  number  of  solutions  desired,  or  “all”  if  all 
solutions  are  desired,  v  is  the  subset  of  the  free  variables  in  g  for  which  values  are  required 
(the  other  free  variables  are  assumed  to  be  existentially  quantified).  If  v  is  the  entire  set  of 
free  variables  in  g,  it  will  be  omitted.  Most  queries  will  be  expressed  simply  as  a  formula  g\ 
this  means  that  it  is  assumed  that  (1)  all  free  variables  are  of  interest,  and  (2)  it  is  irrevelant 
to  the  discussion  whether  one,  some,  or  all  solutions  are  desired. 

Whether  one  needs  all  answers  or  just  a  single  answer  is  important  in  considering  search 
methods.  For  problems  requiring  all  solutions,  the  order  in  which  the  solution  space  is 
searched  is  not  important.  As  long  as  there  is  a  possibility  of  finding  an  answer  in  some 
corner  of  the  space,  that  corner  must  be  searched,  and  it  does  not  matter  whether  it  is 
searched  first  or  last.  On  the  other  hand,  if  one  only  needs  to  find  a  single  answer  to  a 
problem,  it  is  best  to  look  where  an  answer  is  most  likely  to  be  found  quickest.  In  fact, 
to  find  any  proper  subset  of  all  the  solutions,  it  is  best  to  look  first  where  answers  will  be 
found  the  quickest.  To  illustrate  the  above  phenomenon,  consider  the  following  example: 


Example  1.7  Suppose 

G(x)  =  File(ar)  A  Name(x,  chess)  A  Executable(x), 

that,  is,  x  is  an  executable  file  named  chess.  Suppose  also  that  it  is  known 
that  chess  is  a  game,  and  that  most  executable  files  for  games  are  on  directory 
/usr/games.  If  the  goal  is  “Find  all  x :  G(x),”  it  does  not  help  to  know  that 
most  of  the  answers  will  come  from  /usr/games.  On  the  other  hand,  if  the  goal 
is  “Find  1  x:  G(x),”  or  “Find  10  x:  G(x),”  then  it  would  be  smart  to  begin  the 
search  on  /usr/games. 

In  finding  a  single  solution  for  a  conjunctive  goal,  ail  the  solutions  for  a  single  conjunct 
will  be  required  in  the  worst  case.  Even  so,  it  pays  to  find  the  easiest  solutions  first  since 
on  the  average  not  all  the  solutions  will  have  to  be  generated. 

7  Note  also  that  the  order  of  the  search  can  affect  whether  the  search  provides  all  answers  or  whether  it 
loops. 
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1.2.3  The  Qualification  and  Ramification  Problems 

Two  major  problems  in  design  synthesis  are  the  well-known  qualification  problem  (Mc¬ 
Carthy  [65])  and  its  dual  problem,  the  ramification  problem  (discussed  below).  At  the 
outset,  let  us  note  that  these  problems  are  only  indirectly  addressed  by  using  a  declarative 
approach.  It  has  long  been  recognized  that  for  any  real-world  design  component  to  work 
as  expected,  there  are  an  unbounded  number  of  prerequisites  that  must  be  fulfilled.  This 
problem  is  usually  known  as  the  qualification  problem.  The  classic  example  is  the  “potato 
in  the  tailpipe.”  A  rule  might  say,  “If  a  car  has  gas,  turning  the  key  in  the  ignition  will 
cause  the  car  to  start.”  The  above  rule  names  one  prerequisite,  namely,  that  the  car  has 
gasoline.  If,  however,  a  potato  can  be  put  in  the  tailpipe  of  the  car,  the  above  rule  is  no 
longer  correct.8  The  rule  might  be  fixed  to  include  a  “no  potato  in  the  tailpipe”  prerequi¬ 
site,  but  one  can  always  find  another  heretofore  unmentioned  and  obscure  prerequisite  that 
might  be  violated  in  the  real  world. 

A  declarative  approach  to  synthesis  does  not  directly  address  the  qualification  problem. 
It  may  explicitly  name  known  prerequisites  for  a  design  component  to  behave  as  expected, 
but  in  no  way  does  it  tell  one  what  additional  qualifications  to  a  rule  must  be  made.  Thus, 
via  sound  rules  of  inference,  one  can  “prove”  that  a  given  design  will  meet  its  specifications, 
but  the  proof  has  meaning  only  insofar  as  the  descriptions  of  design  components  and  their 
behaviors  actually  describe  the  world. 

A  second  problem  endemic  to  design  synthesis,  whether  declarative  or  otherwise,  is  the 
ramification  problem,  a  dual  problem  to  the  qualification  problem.  In  general,  a  given  design 
not  only  depends  upon  an  unbounded  number  of  prerequisites  (the  qualification  problem), 
but  it  has  an  unbounded  number  of  postrequisites,  that  is,  of  ramifications.  Consider  the  goal 
of  removing  a  single  file  from  a  given  directory.  Removing  all  the  files  from  that  directory 
meets  the  goal  specification,  but  is  probably  unacceptable. 

In  general,  there  is  no  way  for  a  given  goal  specification  to  name  all  the  postrequisites 
that  should  not  be  true  of  a  given  design,  and  a  declarative  and/or  deductive  approach  does 
not  directly  address  this  problem  any  more  than  other  approaches.9 


8One  cannot  help  but  note  that  the  above  “potato  in  the  tailpipe”  scenario  is  wrong,  though  well- 
established  in  the  AI  literature.  The  car  will  start;  it  just  will  not  run  for  very  long  (assuming  yet  other 
conditions  like  “no  holes  in  the  exhaust  system”).  See  Beverly  Hills  Cop  for  demonstration  of  the  more 
realistic  scenario. 

*The  residue  approach  of  Chapter  2  does  provide  a  convenient  hook  for  checking  for  some  undesirable 
postrequisites,  namely,  the  mechanism  of  consistency  checking. 
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1.2.4  Best  Fit  Designs 

This  thesis  considers  the  case  of  designs  that  meet  a  given  goal  specification.  An  interesting 
class  of  problems  are  those  in  which  the  goal  is  overspecified  —  even  though  no  design 
is  expected  to  meet  the  entire  goal  specification,  one  desires  the  design  coming  closest  to 
doing  so  according  to  some  metric.  Examples  of  such  problems  are  studied  in  Barbara  Hayes- 
Roth  and  Frederick  Hayes-Roth’s  errand  planning  work  [41]  and  in  Protean  (Buchanan 
et  al  [10,40]). 

Although  at  first  glance  a  deductive  approach  seems  antithetical  to  “solving”  overspeci¬ 
fied  problems,  it  need  not  be.  Given  a  suitable  metric  for  how  much  of  a  given  goal  has  been 
achieved  by  a  certain  design,  one  could  consider  deductive  synthesis  of  solutions  to  parts 
of  an  overspecified  goal.  Such  problems  are  not  considered  further  in  this  work,  and  to  the 
best  of  the  author’s  knowledge,  deductive  approaches  has  not  been  explored  for  overspecified 
goals. 

1.3  Reader’s  Guide 

Each  of  the  main  ideas  presented  in  this  chapter  is  covered  in  detail  by  a  chapter  of  the 
thesis.  Chapter  2  covers  residues  and  residue  procedures,  Chapter  3  covers  supersumption, 
and  Chapter  4  is  on  finding  ramifications.  Each  is  designed  to  be  as  independent  of  the 
others  as  possible.  Chapter  2  and  Chapter  3  both  stand  as  independent  units,  and  Chapter  4 
has  only  a  slight  amount  of  dependence  on  Chapter  3.  Chapter  5  concludes  with  a  .surninary 
of  results,  limitations  and  future  work. 
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Chapter  2 

Residue 


This  chapter  presents  design  synthesis  as  a  problem  of  finding  residues.  Section  2.1  discusses 
design  synthesis  problems  and  their  scope,  after  which  Section  2.2  is  devoted  to  a  definition 
of  design  synthesis  as  a  problem  in  first-order  logic.  Section  2.3  describes  procedures  for 
generating  residues,  examples  of  which  are  Ordered  Residue,  presented  in  Section  2.4,  and 
Resolution  Residue,  presented  in  Section  2.5.  Ordered  Residue,  based  on  Horn  Clause 
Resolution,  is  the  residue  procedure  used  in  the  rest  of  the  thesis.  It  is  presented  along 
with  a  limited  completeness  result.  Resolution  Residue,  based  on  full  binary  resolution,  is 
presented  with  stronger  completeness  results.  Section  2.6  deals  with  constructing  values  for 
existentially  quantified  variables  in  a  goal  specification.  Section  2.7  compares  and  contrasts 
the  residue  approach  with  the  “single-term  approach”  that  has  been  used  in  most  deductive 
synthesis  systems  to  date.  Section  2.8  compares  and  contrasts  the  residue  approach  to  a 
number  of  other  recent  systems,  and  Section  2.9  presents  conclusions  of  this  chapter. 

2.1  Design  Synthesis 

There  exists  an  almost  unlimited  variety  of  synthesis  or  design  problems.  Circuit  design, 
program  design,  robot  planning,  building  design,  chemical  synthesis,  and  genetic  synthe¬ 
sis  are  a  tiny  subset  of  the  multitude  of  problems  in  which  primitive  building  blocks  are 
composed  in  such  a  way  that  the  result  meets  a  set  of  output  specifications. 

The  terms  synthesis  problem,  design  problem  and  design  synthesis  problem  have  been 
used  in  numerous  contexts.  The  present  work  makes  no  distiction  among  them  (and  uses 
them  interchangably),  but  it  assume.^  the  following  general  scenario: 


There  are  two  agents,  a.  designer  and  an  implementor ,  not  necessarily  dis¬ 
tinct.  The  designer  is  given  a  specification  ( t  lie  goal)  of  the  requirements  of  the 
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object  (material  or  otherwise)  the  implementor  will  implement.1  In  addition, 
the  designer  has  information  about  components  at  the  implementor’s  disposal, 
and  what  the  implementor  is  and  is  not  capable  of  carrying  out.  The  objective 
of  the  designer  is  to  map  the  original  specification  to  another  specification  (the 
design)  such  that  the  new  specification  (1)  describes  an  implementation  meet¬ 
ing  the  original  specification,  and  (2)  describes  the  implementation  process  in 
sufficient  detail  that  the  implementor  is  capable  of  carrying  it  out.  2 

As  emphasized  in  the  above  paragraph,  design  synthesis  is  a  mapping  of  one  set  of 
specifications  to  another;  the  design  itself  is  merely  another  set  of  specifications.  A  useful 
design  will  usually  be  a  set  of  specifications  at  less  abstract  level  than  the  specifications  input 
to  the  design  engine. 

In  any  particular  case,  the  criterion  for  what  constitutes  an  adequate  design  specification 
is  a  somewhat  arbitrary.  Usually,  a  given  domain  will  have  a  language  and  set  of  conventions 
specifying  exactly  what  constitutes  an  adequate  design.  In  building  design,  for  example, 
there  is  a  standard  set  of  drawings  that  must  be  submitted.  These  drawings  do  not  specify 
the  construction  or  construction  procedure  to  any  ultimate  level  of  detail.  Instead,  there  is 
a  set  of  conventions  as  to  the  necessary  level  of  detail.  The  architect  does  have  to  specify 
the  dimensions  and  building  materials  for  the  walls  of  a  house.  In  general,  he  does  not 
specify  what  size  nails  to  use  or  the  order  in  which  the  nails  are  driven,  although  these  facts 
could  in  theory  be  part  of  the  design.  In  short,  there  is  a  certain  threshold  of  detail  that  is 
agreed  upon  by  the  designer  and  implementor  as  being  primitively  achievable  or  assumable 
—  any  such  specification  need  not  be  further  elaborated.  An  adequate  design  (or  simply 
“a  design”)  can  be  thought  of  in  these  terms  as  consistent  set  of  primitively  achievable 
specifications. 


2.2  Design  by  Finding  Residues 

Design  synthesis  lias  long  been  approached  as  a  problem  of  extracting  a  design  from  a  proof. 
W.  S.  Cooper’s  [18]  1964  system,  and  James  Slagle’s  1965  system  DEDUCOM  [84]  were 
perhaps  the  first  steps  in  that  direction.  In  1969,  Cordell  Green's  QA3  [32]  and  Waldinger 

‘To  avoid  confusion,  the  the  object  to  be  designed  and  the  design  specification  itself  need  to  be  distin¬ 
guished.  The  use  of  the  expression  “speeding  up  the  design,”  is  also  avoided  as  it  is  not  clear  whether  the 
design  process  is  accelerated  or  whether  the  the  object  created  by  the  design  will  run  faster  in  some  sense. 

2Another  possibility  is  that  the  design  output  will  be  used  as  the  specification  for  some  other  design 
problem.  In  VLSI  design,  for  example,  one  might  synthesize  a  slicks  level  design  that  is  used  as  the  input  to 
another  design  problem,  namely,  layout.  Having  specified  the  level  of  the  second  design,  the  design  problem 
remains  the  same,  however. 
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Figure  1:  Design  Synthesis:  A  Mapping  from  One  Set  of  Specifications  to  Another 

and  Lee’s  PROW  [99,98]  synthesis  systems  were  both  published.  QA3  and  PROW  were 
both  resolution-based  theorem-proving  systems,  formulating  similar  theorems  to  be  proven, 
but  quite  different  in  their  answer  extraction  techniques.  The  approach  developed  in  these 
early  systems  (called  the  single-term  approach  in  this  thesis)  has  continued  to  be  used  to 
the  present.  In  constrast,  this  section  presents  an  alternative  approach  to  deductive  design 
called  the  residue  approach?  As  explained  in  Chapter  1,  the  residue  approach  starts  with 
a  goal  G,  world  model  W,  and  a  language  A  of  assumable  specifications,  and  from  these  a 
design  s  lljon  D  must  be  found.  The  approach  is  illustrated  in  Figure  1  and  its  main 

compon  described  in  the  following  subsections. 

2.2.1  The  Goal  G 

The  object  to  be  designed  is  described  by  a  closed  formula,  G,  called  the  goal  or  design  goal. 
For  now,  it  will  be  assumed  that  the  values  for  any  existentially  qualified  variables  are  not 
of  interest.  Later,  in  Section  2.6  it  will  be  shown  how  values  for  existential  variables  can  be 
extracted  from  the  proof  of  a  residue. 

Example  2.1  In  a  blocks  world,  consider  the  goal  that  some  block  be  on  top 
of  block  B  at  some  future  time  t;.  If  the  current  time  is  T,,  then  the  goal  might 
be  expressed  as  the  formula 


G  =  3t/True(0n(x,B),t/)  A  (</  >  T,). 

3Chakravarthy  [12,14]  has  used  t  he  term  residue  in  a  quite  different  manner  as  will  be  seen  in  Chapter  4. 
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Note  that  the  above  goal  formula  does  not  mention  the  design  explicitly,  but  only  says 
what  must  be  true  at  a  some  time  in  the  future. 

2.2.2  The  World  Model  W 

In  every  design  problem,  it  will  be  assumed  that  there  is  a  consistent  set  W  of  closed  for¬ 
mulas  modeling  the  designer’s  knowledge  of  the  world.  In  future  sections,  there  will  be 
differing  assumptions  made  about  the  allowed  forms  of  the  formulas  in  W,  e.g.,  that  the 
formulas  in  W  are  implicitly  quantified  clauses  or  Horn  clauses.  In  the  literature,  a  dis¬ 
tinction  is  sometimes  made  between  facts  (generally  meaning  ground  atomic  formulas)  and 
rules  (generally  meaning  conditionals,  usually  containing  universally  quantified  variables). 
No  such  distinction  is  made  here. 

The  axioms  of  W  are  assumed  to  represent  a  number  of  different  sorts  of  information, 
the  distiiiction  between  them  being  somewhat  arbitrary: 

•  Operator  Descriptions:  axioinatizaticn  of  the  functionality  of  the  components  from 
which  objects  may  be  synthesized.  This  set  of  components  need  not  be  the  most  basic 
building  blocks,  but  might  be  a  hierarchy  of  possible  components.  In  the  blocks  world, 
W  might  contain  the  the  following  rule  to  describe  a  puton  action: 

Vx,y,t  True(Clear(x).  t)  A 
True(Clear(3/),  t)  A 
x  /  y  A 

True(Handempty.  t)  A 

Execution(puton(x,  y),  t)  D  True(On(x,  y),  t  +  1) 

Similarly,  to  describe  one  state  of  a  NAND-gate  x,  W  might  contain: 

Va,6,c,x,f  NAND-gate(x)  A 
lnput(x,  1,  a)  A 
lnput(x,2,h)  A 
Output(x.  i, c)  A 
True(Value(a,  “1”),  t )  A 

True(Value(6,  “  1  ” ) ,  i )  D  True(Value(c,  “0”),  t) 


•  State  Description:  Facts  about  the  state  from  which  the  object  is  to  be  constructed. 
In  synthesis  of  an  electric  circuit,  one  is  usually  constructing  a  circuit  from  scratch, 
but  for  other  synthesis  problems,  say  a  robot  planning  problem,  a  crucial  part  of  the 
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problem  is  knowing  the  initial  state,  that  is,  the  state  in  which  the  world  will  be  when 
implementation  of  the  design  (i.e.,  execution  of  the  plan)  begins.  In  addition,  if  other 
relevant  state  transitions  will  occur  independent  of  the  execution  of  the  design,  these 
must  also  be  described. 

The  following  formula  might  express  that  block  A  is  initially  on  the  table: 

True(On(A,  Table),  0), 

or  the  fact  that  block  B  is  fragile  might  be  expressed  as: 

Vx,  t  True(On(x,  B),  t )  D  True(Broken(R),  t  +  1). 


•  Design  Rules:  Additional  formulas  may  represent  the  allowed  configurations  of  de¬ 
sign  components,  for  example,  there  might  be  a  prohibition  on  two  inputs  to  the  same 
NAND-gate: 

Va,b,i,x  NAND-gate(x)  A 
lnput(x,i,a)  A 
lnput(x,i,6)  D  a  =  b 

2.2.3  Assumables 

As  discussed  in  Section  2.1,  for  a  design  to  be  useful,  it  must  be  sufficiently  detailed  so  that 
the  intended  implementor  is  capable  of  carrying  it  out.  In  designing  a  house,  “Bolt  board  A 
to  board  Bv  is  a  primitively  achievable  goal;  we  need  not  further  specify  this  operation. 
In  VLSI  layout,  we  assume  that  as  long  as  certain  design  rules  are  followed,  any  shape  of 
conductor  can  be  put  into  the  silicon.4  Again,  there  is  no  need  for  the  task  to  be  specified 
further. 

For  problems  discussed  here,  it  will  be  assumed  that  a  primitively  achievable  instruction 
is  expressed  as  an  atomic  formula,  ground  or  otherwise.  Such  primitively  achievable  instruc¬ 
tions  will  be  called  assimiables.  Furthermore  it  is  assumed  that  there  exists  a  language  A 
of  assumables  and  an  algorithm  that  decides  in  negligible  time  whether  or  not  a  formula 
is  in  A.  In  practice,  the  language  of  assumables  will  be  specified  by  a  set  of  relations  and 
polarities  (i.e.,  negated  or  not),  such  that  any  atomic  formula  with  the  appropriate  polarity 

4  Whether  the  design  works  as  intended  is  a  function  of  its  following  the  “design  rules”  for  that  fabrication 
process.  It  is  assumed  that  such  design  rules  are  part  of  W  and  are  guaranteed  by  the  consistency  requirement 
(discussed  in  Section  2.2.5. 
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and  relation  is  assumable.  As  stated  in  Chapter  1,  it  will  be  convenient  to  assume  that  A 
is  such  that  a  conjunction  A\  A  . . .  A  An  is  assumable  if  each  of  the  A±  is  assumable. 

Example  2.2  In  the  blocks  world  planning  problem,  the  following  schemata 
might  describe  the  language  A  of  assumables: 

Execution  ( <  action> ,  <  time>  ) 

<time>  i  <  <time?> 

<time>i  <  <time  2> 

2.2.4  Definition  of  Design 

Given  the  above  discussion  of  a  goal  G,  a  world  model  W,  and  a  language  A  of  assumable 
formulas,  let  us  define  a  residue  as  follows: 

Definition  2.1  (Residue)  Given  a  consistent  set  W  cf  well-formed  formulas,  a  closed 
formula  G,  and  a  set  of  ivell-formed  formulas  D  =  {Dj, . . . ,  Dn),  D  is  a  legal  design  if 

1.  W,D  |=  G  (Sufficient), 

2.  W  U  D  is  satisfiable  (Consistent),  and 

3.  For  all  D,  £  D,  D,  €  A.  (Assumable). 

The  requirement  of  sufficiency  is  straightforward  —  Given  the  world  model  W,  the 
goal  G  must  be  entailed  by  the  world  model  and  the  design.  The  requirement  of  asstun- 
ubility  has  been  discussed  in  previous  sections.  Following  an  example,  the  requirement  of 
consistency  will  be  discussed  in  Section  2.2.5. 

Consider  a  simple  example  of  a  design  using  only  propositional  calculus. 

Example  2.3  Suppose  the  set  of  assumable  formulas,  facts  of  the  world  model  W, 
and  goal  G  are  as  below: 

A:  A,  B,C,D,E,F 

W:  BAD-*  J 
D  A  E  -  K 
J  A  K  -*M 
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Given  the  above  and  rules,  W  U  {B,  D,E)  M,  as  illustrated  by  the  series 
of  reduction  steps  below: 

M 

I 

J  M< 

I 

B  A  D  A  K 

I 

B  A  D  A  E 

The  set  of  propositions  {B,  D,  E}  is  consistent  with  W,  that  is, 

W  ^  ^(BADAE), 

so  {B,  D,E}  meets  all  three  criteria  of  Definition  2.1,  and  is  a  residue  for  the 
goal  M  given  world  model  W  and  assumables  A. 

2.2.5  Consistency  of  the  Design 

Let  us  assume  that  the  axioms  in  W  accurately  describe  some  portion  of  the  real  world. 
Then,  for  some  D,  if  W  U  D  is  unsatisfiable,  one  must  assume  that  the  D,  cannot  simul¬ 
taneously  describe  the  any  configuration  of  the  real  world.  As  a  minimum  condition  for 
implementing  a  design,  the  consistency  condition  of  Definition  2.1  must  hold. 

Exampl**  2.4  Suppose  W  contains  a  rule  saying  that  “No  two  actions  can  take 
place  simultaneously,”  i.e., 

Va,6,c,x,t  Execution(a,ti)  A 

Execution^, t2)  D  *i  /  <2- 

By  ignoring  this  rule,  one  might  produce  a  plan  for  switching  the  positions  of 
two  blocks  A  and  B  simply  by  saying,  “Move  block  A  to  the  location  of  block  B 
at  time  T”,  and  “Move  block  B  to  the  location  of  block  A  at  time  T”  i.e., 

Execution (Move( A,  B),T)  A  Execution(Move(B,  A),T). 

Given  the  usual  axiomatizations  of  the  Move  operator,  the  goal  logically  follows 
from  a  plan  executing  these  two  actions  simultaneously,  but  there  would  be  no 
way  to  implement  this  plan  in  the  real  world. 
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In  the  above  example,  a  design  that  was  impossible  to  implement  was  disallowed.  The 
consistency  requirement  also  acts  to  enforce  design  rules;  if  the  design  rules  are  not  satisfied, 
there  is  no  guarantee  that  the  object  designed  will  behave  as  expected.  For  example, 
it  is  possible  to  implement  a  VLSI  layout  that  puts  insufficent  space  between  adjacent 
conductors,  but  it  probably  will  not  work  in  the  manner  the  designer  planned. 

One  might  ask,  “Is  there  a  way  for  a  synthesis  system  to  avoid  the  need  for  consistency 
checking  via  careful  axiomatization?”  The  answer  is,  “yes,”  but  only  in  certain  cases.  This 
issue  will  be  discussed  further  in  Section  2.7. 

The  Complexity  of  Consistency  Checking  In  everyday  life,  given  some  flaw  in  a  de¬ 
sign,  we  presume  we  can  eventually  find  it  (given  enough  time  and  assuming  we  are  capable 
of  understanding  the  flaw  and  its  causes),  but  we  cannot  say  that  how  long  it  will  take  to 
find  the  flaw.  In  checking  whether  a  design  is  consistent,  the  situation  is  exactly  analogous. 
Determining  the  satisfiability  of  a  set  of  first-order  formulas  is  a  non-semidecidable  problem. 
In  other  words,  there  is  no  procedure  that  can  take  an  arbitrary  set  of  formulas  and  always 
determine  in  a  finite  amount  of  time  that  the  set  of  formulas  is  satisfiable.  On  the  other 
hand,  showing  that  a  set  of  formulas  is  unsatisfiable  is  semidecidable,  that  is,  there  are 
procedures  guaranteed  to  prove  an  unsatisfiable  set  to  be  unsatisfiable  in  a  finite  amount 
of  time.  So,  if  a  set  of  first-order  formulas  is  inconsistent,5  we  can  eventually  discover  this 
fact,  but  there  is  no  way,  in  general,  to  ever  be  sure  that  a  given  set  of  facts  is  consistent. 

The  impact  of  the  above  phenomenon  for  a  design  problem  is  clear:  either  the  language 
of  W,  G  and  D  must  be  restricted,  or  one  must  settle  for  less  than  perfect  guarantees  of 
consistency.  There  are  numerous  useful  subsets  of  first-order  logic  that  are  decidable,  that 
is,  for  which  there  exists  an  algorithm  for  deciding  in  a  bounded  amount  of  time  whether  or 
not  a  given  formula  is  valid  (and  as  a  result,  whether  a  given  set  of  formulas  is  consistent). 
Besides  the  obvious  example  of  the  propositional  calculus,  Manna  [57],  page  107,  gives  many 
examples  of  other  decidable  subsets  of  the  first  order  predicate  calculus. 

Non-Guaranteed  Consistency  Checks  Although  checking  consistency  of  a  set  of  facts 
in  intractable,  for  a  given  problem  at  the  least  one  hopes  to  find  some  wa.y  to  do  an  accept¬ 
able,  though  imperfect  job  of  checking  that  a  design  will  behave  as  expected. 

For  a  given  W  and  D  it  is  possible  that  all  search  paths  will  be  exhausted  in  trying  to 
show  that  W  ]=  — > D ,  where  D  is  the  conjunction  of  all  D,  G  D.  In  such  a  case,  the  design  D 
is  consistent  with  W.  Failing  this,  one  would  like  to  assume  that  given  an  inconsistent 

J  Inconsistency  and  unsatisfiabilily  are  equivalent  conditions  in  first-order  logic,  as  was  proven  by  Godel 
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Valid  Formula 

True  under  all  interpretations 
(Semidecidable) 

Satisfiable  Formula 
True  under  some  interpretations 
(Non-semidecidable) 


Unsatisfiable  Formula 
True  in  no  interpretations 
(Semidecidable) 


Figure  2:  Valid,  Satisfiable,  and  Unsatisfiable  Formulas  of  First-Order  Logic 

design  D,  the  likelihood  of  proving  the  design  inconsistent  increases  monotonically  with  the 
amount  of  time  spent  in  checking  consistency.  In  other  words,  one  might  want  to  assume 
that  if  no  inconsistency  is  found  after  spending  one  minute  of  cpu  time,  it  is  quite  unlikely 
that  the  design  is  inconsistent;  if  after  two  minutes  inconsistency  has  not  been  proven,  it  is 
even  less  likely  that  the  design  is  inconsistent. 

Given  the  above  assumption,  a  residue  procedure  can  legislate  the  amount  of  time  or 
space  spent  looking  for  inconsistency  before  declaring  the  design  to  be  consistent.  Such 
a  decision  might  also  be  based  on  the  course  of  the  proof,  i.e.,  whether  or  not  it  looks 
“promising”  that  the  design  will  be  proven  inconsistent,  although  that  possibility  will  not 
be  considered  further  here. 

Finally,  for  a  given  problem  it  might  be  reasonable  to  rely  on  some  ad  hoc  method  of 
consistency  checking.  One  may  choose  to  enumerate  the  classes  of  potential  problems  and 
check  only  these  possibilities.  The  checking  can  be  via  arbitrary  procedures  rather  than  by 
any  logical  inference.  The  “critics”  of  Sussman’s  Hacker  [94]  or  Sacerdoti’s  Noah  [81]  are 
typical  of  such  an  approach. 


Example  2.5  Consider  building  a  combinational  circuit  out  of  a  set  of  wires 
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and  NAND-gates.  There  are  a  limited  number  of  ways  the  system  can  go  wrong, 
i.e.,  that  is,  fail  to  act  as  expected  in  spite  of  the  correct  wires  being  in  place: 

1.  There  may  be  a  loop  in  the  circuit. 

2.  There  may  be  more  than  one  output  connected  to  a  single  input. 

The  above  two  design  rules  can  quickly  be  checked,  and  assuming  that  they  are 
not  violated,  the  designer  may  be  willing  to  assume  (or  “legislate”)  that  the 
circuit  will  work  as  expected. 


2.3  Residue  Procedures  for  Design  Synthesis 


The  previous  section  formulated  design  synthesis  as  the  problem  of  finding  residues  for 
a  goal  G,  a  world  model  W,  and  a  language  of  assumables  .4.  The  current  section  and 
the  two  following  describe  two  procedures  for  finding  residues.  This  section  describes  the 
notions  common  to  the  entire  class  of  what  might  be  called  “residue  procedures.”  Section  2.4 
presents  Ordered  Residue,  a  residue  procedure  based  upon  backwards  inference  very  similar 
to  that  done  in  Mycin  [82,9]  and  Prolog  [78].  Ordered  Residue  will  be  used  extensively  in 
later  chapters.  Section  2.5  presents  Resolution  Residue,  a  more  general  residue  technique 
using  binary  resolution  [76]  as  its  backwards  inference  technique.  Appropriate  completeness 
results  are  proven  for  each. 

Figure  3  is  a  high  level  description  of  a  canonical  residue  procedure.  The  central  idea  is 
that  the  original  goal  G  is  reduced  to  other  goals6  via  a  sequence  of  goal  reduction7  steps 
until  an  assumable  goal  is  found,  that  is,  a  goal  for  which  D  6  A.  In  Figure  3,  the  reduced 
goal  is  represented  by  the  symbol  D  to  emphasize  the  fact  that  any  goal  is  potentially  a 
proposed  design.  If,  at  any  time,  the  reduced  goal/design  is  inconsistent  (that  is,  W  [=  -iD), 
then  this  path  can  be  pruned;  further  reduction  steps  cannot  make  the  inconsistent  design 
consistent. 

The  step  “D  <—  Goal  Reduction(D)”  in  Figure  3  is  nondeterministic.  On  each  iteration 
through  the  loop,  the  goal  reduction  step  may  produce  none,  one  or  many  new  goals  D. 

6It  is  tempting  to  say,  “the  goal  G  is  reduced  to  various  subgoals,”  but  the  term  “subgoal”  has  a  very 
specific  (and  different)  meaning  in  logic  programming,  namely,  a  goal  is  a  conjunction  of  literals  and  a 
subgoal  is  one  of  the  conjuncts.  To  avoid  confusion,  the  word  “snbgoal”  will  usually  be  avoided  here.  In  the 
rare  instances  in  which  the  word  “subgoal”  appears,  it  will  refer  to  the  entire  goal  to  which  another  goal  has 
been  reduced. 

'  Coni  reduction  known  by  many  other  names  in  the  literature,  among  them  backwards  inference,  goal- 
directed  reasoning,  subgoaling,  top-down  reasoning ,  goal  regression,  and  consequent  reasoning.  Here,  the 
terms  “goal  reduction”  and  “backwards  inference”  will  be  used  interchangably. 
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Figure  3:  Simplified  View  of  a  Residue  Procedure 
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J  H  A  /  A  J 

K  A L  D  i/ /  \NAPdH 


MD  K 


NAPAIAJ 


I  M  A  L  A  7  A  J 


Figure  4:  Goal  Reduction  Steps 

Each  of  the  new  goals  D  is  a  new  path  in  the  space  of  possible  goal  reductions  from  the 
original  goal  G.  If  for  some  D  no  goal  reductions  are  possible,  that  path  simply  halts  without 
returning  an  answer. 

Example  2.6  Consider  the  goal  G  =  H  A  /  A  J,  where  W  contains  the  axioms 
K  A  L  D  H,  M  D  I(,  and  N  A  P  D  H .  Figure  4  shows  possible  goal  reductions 
that  might  be  made.  Each  node  shown  represents  a  point  in  the  space  searched 
by  the  nondeterministic  step  “D  <—  Goal  Reduction(D).” 

The  “Consistent(D)?”  steps  in  the  flowcharts  describe  a  program  step  solving  an  in¬ 
tractable  problem,  an  obvious  impossibility.  “Consistent D)?”  should  be  understood  to 
refer  to  the  heuristic  approach  (described  in  the  previous  section)  being  used  in  the  particu¬ 
lar  problem  to  approximate  deciding  whether  W  j=  ->D.  Note  that  the  consistency  checking 
step  appearing  in  the  loop  is  not  needed  for  correctness  of  the  procedure,  but  only  to  prune 
goals  that  are  inconsistent  as  soon  as  possible.  In  practice,  one  may  chose  to  eliminate  this 
step  entirely  or  make  it  a  minimal  sort  of  check. 

For  simplicity’s  sake  another  fiction  that  has  been  allowed  in  Figure  3.  Every  reduced 
goal  in  the  figure  springs  from  exactly  one  previously  existing  goal.  It  is  possible,  however, 
to  use  more  than  one  goal  Di, . . .,  Dn  to  find  a  new  goal  D'  such  that  W  U  {D}  |=  G.  Such 
is  the  case  in  Resolution  Residue,  but  the  distinction  is  not  crucial  here. 

The  rest  of  this  chapter  concerns  itself  with  the  goal  reduction  component  of  residue  pro¬ 
cedures  and  is  independent  of  the  consistency  checking,  in  future  chapters,  goal  reduction 
(as  opposed  to  consistency  checking)  will  sometimes  be  called  the  “backwards  component'’ 
of  the  residue  procedure  because  it  consists  of  backwards  inference  from  a  goal  descrip¬ 
tion.  Consistency  checking  will  sometimes  be  referred  to  as  the  “forward  component”  of 
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the  procedure,  since  one  can  think  of  consistency  checking  as  forward  inference  from  the  set 
W  U  {D}  —  if  false  is  derived  from  W  U  {D},  then  D  is  not  a  consistent  design.  As  can  be 
seen  from  Figure  3,  the  forward  and  backwards  components  compete  for  resources.  Chap¬ 
ters  3  and  4  will  generalize  consistency  checking  and  deal  with  the  competition  between  the 
forward  and  backwards  components. 

2.4  Ordered  Residue 

This  section  presents  the  Ordered  Residue  procedure.  In  a  nutshell,  Ordered  Residue  is 
the  residue  procedure  obtained  by  performing  backwards  inference  via  Ordered  Resolution 
on  Head-First  Ordered  Horn  Clauses  under  the  set  of  support  restriction.  All  of  these 
terms  will  be  defined  later  in  this  section.  In  addition,  Ordered  Resolution  under  the  above 
restrictions  is  a  close  relative  of  Prolog  [78]  as  will  be  discussed  in  Section  2.4.4. 

2.4.1  Ordered  Resolution 

Li  the  literature  on  resolution,  clauses  are  variously  defined  as  sets  or  bags  of  literals,  and 
they  may  be  ordered  or  unordered.  For  Ordered  Resolution,8  a  clause  is  a  sequence  of 
distinct  literals,  that  is,  an  ordered  set.  Definitions  2.2-2.4  are  from  Chang  and  Lee  [15]. 

Definition  2.2  (Ordered  Clause)  An  ordered  clause  is  a  sequence  of  distinct  literals. 

Definition  2.3  A  literal  L2  is  said  to  be  greater  than  a  literal  L\  in  an  ordered  clause  (or 
L\  is  smaller  than  L2)  if  and  only  if  L2  follows  L\  in  the  sequence  specified  by  the  ordered 
clause. 

Ordered  Resolution  is  defined  with  Definitions  2.4-2.6. 

Definition  2.4  (Ordered  Factor)  If  two  or  more  literals  (with  the  same  sign)  of  an 
ordered  clause  C  have  a  most  general  unifier  o ,  then  the  ordered  clause  obtained  from  the 
sequence  Co  by  deleting  any  literal  that  is  identical  to  a  sr  Iter  literal  in  the  sequence  is 
called  an  ordered  factor  of  C. 

Definition  2.5  (Ordered  Binary  Resolvent)  Let  C\  and  C2  be  ordered  clauses  with  no 
variables  in  common.  Let  L\  and  L2  be  the  smallest  literals  in  C 1  and  C2,  respectively. 
If  L\  and  ->L2  have  a  most  general  unifier  <7,  and  if  C  is  the  ordered  clause  obtained  by 

fThe  resolution  rule  defined  here  is  a  variant  of  Boyer's  Lock  Resolution  [5],  and  it  differs  significantly 
from  Ordered  Resolution  as  defined  in  Chang  and  Lee  [15],  page  113. 
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concatenating  the  sequences  C\0  and  C^c,  removing  L\0  and  L20 ,  and  deleting  any  literal 
that  is  identical  to  a  smaller  literal  in  the  remaining  sequence,  then  C  is  called  an  ordered 
binary  resolvent  of  C\  against  C?.  The  literals  L\  and  L2  are  the  literals  resolved  upon. 

Definition  2.6  (Ordered  Deduction)  A  clause  C  is  said  to  be  deduced  via  an  ordered 
deduction  from  base  set  S  of  ordered  clauses  if  and  only  if  there  is  a  tree  T  such  that  C  is 
in  T,  every  node  in  the  fringe  ofT  is  a  member  of  S,  and  for  every  other  node  D  either: 

1.  D  has  one  parent  P,  and  D  is  an  ordered  factor  of  P,  or 

2.  D  has  two  parents  P  and  Q,  and  D  is  an  ordered  binary  resolvent  of  P  against  Q. 

Soundness  of  Ordered  Resolution  follows  from  the  set  of  allowed  steps  being  a  subset  of 
the  allowed  steps  in  ordinary  Binary  Resolution.  Treitel  and  Genesereth  [95]  have  proved 
the  following  completeness  theorem: 

Theorem  2.1  (Completeness  of  Ordered  Resolution  on  Horn  Clauses)  [Treitel 
and  Genesereth]  A  set  S  of  ordered  Horn  clauses  is  unsatisfiable  is  and  only  if  there  is 
an  ordered  deduction  of  the  empty  ordered  clause  □  from  S . 

Ordered  Residue  uses  the  set  of  support  restriction,  defined  as  follows: 

Definition  2.7  (Set  of  Support)  /I  subset  T  of  a  set  S  of  clauses  is  called  a  set  of  support 
of  S  if  S  -1'  is  satisfiable.  An  (ordered)  set  of  support  resolution  is  an  (ordered)  resolution 
of  two  clauses  that  are  not  both  from  S  -  T.  An  (ordered)  set  of  support  deduction  is  a 
deduction  in  which  every  (ordered)  resolution  step  is  an  ( ordered )  set  of  support  resolution. 

Unless  otherw'0'’  stated,  the  initial  set  of  support  T  is  understood  to  be  the  set  of  clauses 
from  the  negation  of  the  goal. 

Ordered  resolution  is  complete  for  Horn  clauses,  but  unfortunately  it  is  not  compatible 
with  the  set  of  support  restriction  unless  the  positive  literal  is  always  the  smallest  (first) 
literal  in  the  clause.  Consider  the  following  example: 

Example  2.7  Suppose  we  have  a  database 

{B  A,B} 


and  a  goal  A.  This  would  correspond  to  the  unsatisfiable  set  of  ordered  clans 's 
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and  initial  set  of  support  Ordered  resolution  would  not  be  able  to 

prove  this  set  unsatisfiable  under  the  set  of  support  restriction  since  the  or¬ 
dered  clause  -*B  V  A  does  not  resolve  with  -> A.  If  the  ordered  clause  -> B  V  A 
were  replaced  with  the  ordered  clause  A  V  ->B,  then  there  would  be  an  ordered 
refutation  for  this  set. 

(although  an  ordered  clause  AV  ->B  would  succeed  since  it  can  resolve  with  ->A). 

We  give  the  name  HOH-clause  to  Horn  clauses  with  the  first  literal  being  the  positive 
one. 

Definition  2.8  (HOH-Clause)  A  Head-first  Ordered  Horn  clause  (HOH-clause)  is  an 
ordered  clause  such  that  all  literals  in  the  sequence  are  negative  with  the  possible  exception 
of  the  first,  which  may  be  positive. 

Given  a  formula  such  as  B  A  C  A  D  D  A,  any  (or  all)  of  the  HOH-clauses  below  might 
be  found  in  the  database: 


A  V  -*B  V  -i C  V  ->D 
A  V  ~>B  V  ~iD  V  ~iC 
A  V  ->C  V  -iB  V  ~>D 
A  V  -iC  V  ->D  V  ->B 
A  V  ~>D  V  -i C  V  -iB 
A  V  -'D  V  -<B  V  -tC 

For  Ordered  Resolution  on  HOH-clauses,  the  set  of  support  restriction  preserves  com¬ 
pleteness,  as  has  been  proven  by  Treitel  and  Genesereth  [95]: 

Theorem  2.2  [Treitel  and  Genesereth]  Given  a  goal  G  such  that  the  clauses  of  iG 
contains  only  negative  literals,  a  database  W  of  HOH-clauses,  and  using  the  ordered  clauses 
from  ~>G  as  the  initial  set  of  support,  there  exists  an  ordered  deduction  of  the  null  clause  □ 
from  base  set  W  U  Clauses(G)  and  initial  set  of  support  Clauses(G)  if  and  only  t/W|=G. 

2.4.2  Ordered  Residue  Procedure 

In  order  to  use  ordered  resolution  for  generating  residues,  it  is  necessary  to  somehow 
notice  negated  assumables  as  the  first  literal  and  move  them  so  that  the  assumable  does 
not  prevent  other  literals  in  the  clause  from  being  used  in  subsequent  ordered  resolution 
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steps.  On*5  simple  way  to  accomplish  this  goal  is  to  allow  the  rotation  of  a  negated  as¬ 
sumable  from  the  first  literal  to  being  the  last  literal.  Let  us  define  a  relation  called 
Assumable-Rotation(a,  b)  as  follows: 

Definition  2.9  A  pair  of  ordered  clauses  a  and  b  is  a  member  of  the  Assumable- Rotation 
relation  if  and  only  if  for  ordered  clause  a  =  aj  V  ...  V  am,  ->ai  is  an  assumable,  and 
b  =  02  V  . . .  V  am  V  or. 

In  other  words,  b  has  rotated  the  first  literal  of  a  to  the  rear.  Such  a  rotation  can  not  change 
completeness  of  Ordered  Resolution  since  it  only  adds  to  the  range  of  possibilities.  Such  a 
rotation  also  preserves  soundness,  as  the  meaning  of  a  derived  clause  is  not  a  function  of 
the  clause’s  order.  The  Ordered  Residue  Procedure  may  then  be  defined  as  in  Figure  5. 

The  “Choose  some  C  €  T"  step  has  not  been  fully  specified.  The  implementation  of 
this  step  controls  the  search,  e.g.,  by  always  choosing  the  most  recently  added  member  of  T, 

1  one  obtains  a  depth-first  behavior.  Similarly,  by  always  choosing  the  least  recently  added 

'  member  of  T,  one  obtains  a  breadth-first  search.  Note  also  that  the  clause  C  chosen  is 

I 

i  removed  from  T  after  being  chosen;  there  is  no  further  need  for  it. 

!  2.4.3  Completeness  of  Ordered  Residue 

i  The  completeness  results  of  this  section  and  of  Section  2.5.3  are  expressed  in  terms  of 

one  clause  subsuming  another.  The  definition  of  subsumption  is  the  same  for  ordered  and 
ordinary  clauses,  treating  them  both  as  sets  of  literals. 

i 

|  Definition  2.10  A  clause  C  subsumes  a  clause  D  if  and  only  if  there  is  a  substitution  a 

1  such  that  Co  C  D.  D  is  called  a  subsumed  clause. 

The  main  result  of  this  section  can  now  bo  stated: 

Theorem  2.3  (Completeness  of  Ordered  Residue  on  HOH-clauses) 

Given  a  residue  D  for  world  model  W,  goal  G,  and  assumable  language  A,  wheie 
W  is  a  sntisfiable  set  of  HOII-claxtses, 

G  =  Gj  V  . . .  V  Gm,  where  the  G,  are  conjunctions  of  positive  literals,  and 
D  =  {Dj , . . . ,  D„}  is  a  set  of  atomic  assumables, 

there  exists  an  ordered  set  of  support  deduction  of  a  clause  A  —  -laj  V  ...  V  -<aq  from 
initial  set.  of  support  Clauses(-<G)  such  that  {«i, . . . ,  ar(}  is  a  residue  for  W,  G,  and  A  and 
such  that  A  subsumes  — > D i  V  ...  V  ->Dn. 
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Before  proving  Theorem  2.3,  a  number  of  oilier  results  will  be  helpful.  The  following 
“lifting  lemma”  (as  stated  in  Wos  et  al  [106])  is  the  appropriate  formulation  for  a  resolution 
systems  with  separate  resolution  and  factoring  steps.  It  differs  from  the  common  formulation 
found  in  Chang  and  Lee  [15]. 

Lemma  2.4  (Lifting  Lemma)  [Robinson]  If  A'  and  B'  are,  respectively,  ground  in¬ 
stances  of  (ordered)  clauses  A  and  B  (which  are  assumed  to  have  no  variables  in  com¬ 
mon),  and  if  C'  is  a  (ordered)  resolvent  of  A'  and  B' ,  then  there  exist  (ordered)  clauses  E 
and  F  such  that  an  application  of  (ordered)  binary  resolution  to  E  and  F  yields  a  clause  C, 
where  C'  is  an  instance  of  C,  and  where  E  =  A  or  is  a  factor  of  A  and  F  =  B  or  is  a 
factor  of  B. 

In  addition,  Herbrand’s  Theorem  will  be  used: 

Lemma  2.5  [Herbrand]  A  set  S  of  (ordered)  clauses  is  unsatisfiable  if  and  only  if  there 
exists  a  finite  set  of  ground  instances  of  S  that  is  truth-functionally  unsatisfiable. 

Finally,  the  following  lemma  will  be  useful. 

Lemma  2.6  For  any  formulas  G  and  A,  if  there  exists  a  derivation  of  a  clause  ->A  from 
base  set  W  U  Clauses(-iG),  then  W,  A  b  3G. 

Proof:  Let  us  define  Go  =  Gi  V . .  .VGm,  where  Clauses(-iG)  =  {— -Gx , . . .,  -Gm). 

By  the  soundness  of  resolution,  the  derivation  of  -iA  means  that 


W,-.G0  b  -A.  (2) 

Via  the  deduction  theorem  and  a  series  of  equiavalences  we  get 

W  )=  iG0  D  -.A  (3) 

W  |=  (3A)D(3G0)  (4) 

W,  3A  b  3G0  (5) 

W,  A  |=  3(Gj  V  ...  V  Gm).  (6) 


Formula  (6)  is  the  desired  formula.  It  says  that  there  exists  some  solution  to  G 
for  any  values  of  the  free  variables  of  A.  I 
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Proof  of  Theorem  2.3  Case  I:  (W  3G) 

The  theorem  is  vacuously  true  for  this  case.  Let  D  be  the  null  set.  Then  we  have 

1.  D  subsumes  {} 

2.  W  U  D  is  satisfiable. 

3. WUD  \=  3G. 

Case  II:  (W  £  3G) 

Let  M  =  W  U  D  U  Clauses(-iG).  Since  WUD  j=G,Mis  unsatisfiable.  By  Herbrand’s 
Theorem  (Lemma  2.5),  there  must  exist  some  unsatisifiable  set  Mo  of  ground  instances 
of  M.  Let  Mw  be  the  subset  of  clauses  in  Mo  from  W,  Md  be  the  subset  of  clauses  in 
Mo  from  D,  and  Mg  be  the  subset  of  clauses  in  Mo  from  Clauses(-iG).  Since  W  U  D  is 
satisfiable,  Mw  U  Mg  is  satisfiable  and  Mg  is  non-empty.  By  Theorem  2.2  there  exists  an 
ordered  set  of  support  refutation  Ro  of  Mo  using  Mg  as  the  initial  set  of  support.  Since 
W  ^  3G,  Rc  contains  at  least  one  clause  from  Mq  in  its  fringe. 

Consider  now  the  following  two  lemmas: 

Lemma  2.7  Given  two  ground  ordered  clauses  A\  and  A'2  for  which  C  is  an  ordered  binary 
resolvent,  then  there  exists  an  ordered  binary  resolvent  C  for  ordered  clauses  Ax  and  A2, 
where  A\  is  any  ordered  clause  consisting  of  the  same  sequence  of  literals  as  A\  with  some 
(possibly  empty)  set  Z\  of  ground  literals  anywhere  interspersed  in  its  sequence  except  the 
first  position,  and  A2  is  any  ordered  clause  consisting  of  the  same  sequence  of  literals  as  A'2 
with  some  (possibly  empty)  set  Z2  of  ground  literals  anywhere  interspersed  in  its  sequence 
except  the  first  position.  Furthermore  C  consists  of  the  same  sequence  of  literals  as  C  with 
some  (possibly  improper)  subset  of  Z\  U  Z2  interspersed  in  its  sequence. 

Proof:  Since  the  first  literals  of  A\  and  A',  are  unchanged  in  .4i  and  A2,  .4j 
and  A2  can  be  resolved  as  were  A\  and  A'2.  The  resulting  ordered  clause  C  will 
be  identical  to  C'  except  for  the  additional  negated  literals  introduced  by  A\ 
and  A'2.  Furthermore,  C  -  C'  C  Z\\JZ2  since  some  of  the  literals  introduced  by 
Z\  or  Z2  may  be  deleted  if  an  identical  and  smaller  literal  is  present. 

Lemma  2.8  Given  a  ground  ordered  clause  A!  for  which  C'  is  an  ordered  factor,  then  there 
exists  an  ordered  factor  C  of  ordered  clause  A,  where  A  consists  of  the  same  sequence  of 
literals  as  A!  with  some  (possibly  empty)  set  Z  of  negated  ground  literals  interspersed  in  its 
sequence,  and  C  consists  of  the  same  sequence  of  literals  as  C"  with  some  (possibly  improper) 
subset  of  Z  interspersed  in  its  sequence. 
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Proof:  For  ground  ordered  clauses,  factoring  consists  only  of  deleting  dupli¬ 
cate  literals.  The  literals  of  additional  literals  from  Z  can  not  affect  a  preexisting 
possible  factorization.  I 


Consider  some  a  E  M#,  that  is,  a  ground  atomic  assumable  appearing  in  Rq.  Rq  can 
be  modified  by  eliminating  an  ordered  resolution  of  a  set  of  support  ordered  clause  -ioV5 
against  a  ground  assumable  a.  Instead  of  using  the  resolvent  B,  by  Lemmas  2.7  and  2.8 
a  corresponding  ordered  set  of  support  derivation  R\  can  be  constructed  with  the  negated 
assumable  -ia  left  in  the  tree  and  allowed  to  percolate  toward  the  top.  To  build  the  tree, 
note  that  it  usually  will  be  necessary  to  rotate  negated  ground  assumables  such  as  ->a  to 
the  end  of  ordered  clauses.  Taking  R\,  one  can  repeat  the  process  constructing  R2,  ■  ■  ■ ,  Rp 
until  all  members  of  Mo  have  been  eliminated  from  the  fringe  of  ordered  set  of  support 
deduction  Rp.  Rp  is  then  an  ordered  deduction  of  some  clause  Cp  from  base  set  Mw  U  Me 
such  that  (1)  Rp  consists  only  of  negations  of  ground  atomic  assumables  from  M#,  and  (2) 
Rp  is  a  set  of  support  deduction  with  initial  set  of  support  Mg- 

By  the  Lifting  Lemma  (Lemma  2.4),  Rp  can  be  converted  to  another  set  of  support 
derivation  D *  such  that  the  fringe  consists  of  ordered  clauses  from  W  U  Clauses(->G).  The 
root  of  Dp  is  given  by  Cv9~x  —  -1  a\  V  for  some  substitution  9 ,  where  the  a\  are 

atomic  formula  from  D  with  some  of  the  constants  in  these  formulas  possibly  replaced  by 
variables.  Thus  for  each  a$,  there  exists  some  substitution  <fi  such  that  a '<£  E  D. 

So, 

1.  -1  a\  V  . . .  -1  a'  subsumes  -iDj  V  . .  .->Dn 

2.  W  U  {a'j , . . . ,  a^}  is  satisfiable  since  it  has  a  satisfiable  instance  Mjy  U  Mp 

3.  W  U  {ai, . . .  ,a'q}  f=  3G  by  Lemma  2.6. 

I 


2.4.4  Relation  to  Prolog 

As  stated  earlier,  the  set  of  steps  allowed  by  Ordered  Resolution  is  closely  related  to  the 
set  of  steps  made  by  a  Prolog  Interpreter  [78].  Looking  at  the  search  space  of  Prolog  as 
an  AND-OR  tree,  with  conjunctions  of  literals  to  solve  via  disjunctions  of  possible  ways 
to  reduce  the  literals,  Prolog  considers  both  conjunctions  and  disjunctions  in  depth-first, 
fashion.  In  other  words,  given  a  goal  (that  is,  a  conjunction  to  be  solved) 
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Prolog  will  completely  solve  (either  find  an  answer  for  or  else  fail  on)  A  before  considering 
B.  Given  rules  (that  is,  a  disjunction  of  possible  ways  to  solve  the  conjunct  A) 

A  :  -  E,F,G 
A  :  —  H,  I,  J,  K 
A  :  -  L,M,N, 

Prolog  will  also  exhaust  possibilities  for  finding  answers  to  A  via  the  first  of  the  rules  before 
considering  the  second.  Thus,  given  a  goal  and  an  ordering  of  the  rules  in  the  database, 
Prolog  specifies  precisely  the  order  in  which  inference  steps  may  be  made. 

Ordered  Resolution  is  similar  to  Prolog  in  handling  goal  conjuncts  in  order,  but  differs 
from  Prolog  in  not  specifying  the  order  in  which  various  rules  may  be  applied.  It  will  turn 
out  that  this  depth -first  ordering  on  conjuncts  is  important  to  the  techniques  of  Chapters  3 
and  4,  but  there  is  no  need  in  this  work  to  specify  the  order  in  which  possible  goal  reductions 
on  the  same  conjunct  are  tried. 


2.5  Resolution  Residue 

Of  the  many  inference  techniques  one  might  use  as  the  backwards  inference  engine  of  residue, 
perhaps  the  most  obvious  one  is  binary  resolution  (Robinson  [77]).  Binary  resolution  (usu¬ 
ally  just  called  “resolution”  for  simplicity)  has  been  greatly  studied  for  two  decades  and 
is  very  well  understood.  In  addition,  it  is  easily  implemented,  and  is  refutation  complete. 
To  use  resolution  as  a  backwards  inference  technique,  the  set  of  allowed  resolutions  must 
be  restricted  since  unrestricted  resolution  allows  many  more  inferences  than  just  backwards 
inference  steps.  Fortunately,  one  of  the  best  known  restrictions  on  resolution,  the  set  of 
support  restriction  (Wos  et  al  [105]),  is  exactly  that  —  a  restriction  on  resolution  allowing 
only  backwards  inference  steps.  As  shown  by  Wos,  et  al,  the  support  restriction  preserves 
refutation  completeness.  We  might  informally  say  that  this  results  means  that  that  one  can 
walk  a  given  search  path  from  the  goal  to  the  initial  state  just  as  well  as  from  the  initial  to 
the  goal  state. 

The  remainder  of  this  section  explores  using  resolution  under  the  set  of  support  restric¬ 
tion  as  the  sole  backwards  inference  technique  in  residue.  The  procedure  so  derived  will  be 
called  Resolution  Residue  and  is  illustrated  in  Figure  6. 
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Figure  6:  The  Resolution  Residue  Procedure 
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2.5.1  Definitions 

The  definitions  below  are  one  standard  formulation  for  binary  resolution,  differing  from 
Robinson’s  original  formulation  in  having  separate  factoring  and  resolution  steps.9  As 
contrast  to  Ordered  Resolution,  clauses  are  treated  here  as  sets  of  literals  rather  than 
sequences.  Somewhat  different  presentations  of  resolution  and  the  set  of  support  restriction 
may  be  found  in  texts  by  Chang  and  Lee  [15],  Manna  [57],  and  Wos,  et  al  [106]. 

Definition  2.11  (Factor)  If  two  or  more  literals  (with  the  same  sign)  of  a  clause  C  have 
a  most  general  unifier  o,  then  Ccr  is  called  a  factor  of  C.  If  Co  is  a  unit  clause,  it  is  a 
called  a  unit  factor  of  C. 

Definition  2.12  (Binary  Resolvent)  Let  C\  and  Ci  be  two  clauses  (called parent  clauses ) 
with  no  variables  in  common.  Let  L\  and  Li  be  two  literals  in  C\  and  Ci,  respectively.  If 
L\  and  -iLi  have  a  most  general  unifier  o,  then  the  clause 

(Cio  -  L\o)  U  ( Cio  -  Lio) 

is  called  a  binary  resolvent  of  Ci  and  Ci.  The  literals  L\  and  Li  are  called  the  literals 
resolved  upon. 

Binary  resolvents  usually  are  simply  called  resolvents. 

Definition  2.13  (Deduction)  Given  a  set  S  of  clauses,  a  (resolution)  deduction  of  C 
from  S  is  a  finite  sequence  C\,Cj, .  ■  -,Ck  of  clauses  such  that  each  Ci  is  either  a  clause 
in  S,  a  binary  resolvent  of  two  clauses  preceding  Ci  or  a  factor  or  a  clause  preceding  Ci .  A 
deduction  of  the  null  clause  □  from  S  is  called  a  refutation  of  S. 

Theorem  2.9  states  Robinson’s  well-known  result  that  resolution  is  sound  and  refutation 
complete.  It  is  given  here  without  proof.10 

Theorem  2.9  [Robinson]  (Completeness  of  the  Resolution  Principle)  A  set  S  of 
clauses  is  unsatisfiable  if  and  only  if  there  is  a  deduction  of  the  empty  clause  □  from  S. 

9The  definitions  here  are  similar  to  Loveland’s  [52]  formulation  of  resolution. 

'“Because  the  formulation  of  resolution  differs  from  Robinson,  one  must  be  careful  in  claiming  his  sound¬ 
ness  and  completeness.  The  formulation  here  defines  factors  and  binary  resolvants  identically  to  Chang  and 
Lee,  but  differs  from  Chang  and  Lee’s  definition  of  deduction  in  allowing  separate  factoring  steps  rather 
than  only  combined  factoring-resolution  steps.  If  one  is  willing  to  rely  on  the  completeness  result  proven 
in  Chang  and  Lee,  it  is  clear  that  allowing  a  superset  of  Chang  and  Lee’s  steps  cannot  hurt  completeness. 
In  addition,  since  factoring  steps  are  sound,  soundness  is  preserved  as  well.  Actually,  the  formulation  of 
resolution  here  is  identical  to  that  of  Loveland  [52]  in  which  clauses  are  sets  of  literals,  and  resolution  and 
factoring  steps  are  separate. 
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Theorem  2.10  is  an  important  result  from  Wos  et  al  [105],  that  of  the  completeness  of 
the  set  of  support  strategy  (Definition  2.7):11 

Theorem  2.10  [Wos]  (Completeness  of  the  Set  of  Support  Strategy)  If  S  is  an 
unsatisfiable  set  of  clauses  and  T  C  S  such  that  S  —  T  is  satisfiable,  then  there  exists  a 
refutation  of  S  with  set  of  support  T. 

2.5.2  The  Resolution  Residue  Procedure 

Figure  6  gives  the  procedure  for  Resolution  Residue,  with  Resolvent (C,  D,  E)  meaning 
that  E  is  a  binary  resolvent  of  clauses  C  and  D  according  to  Definition  2.12  and  Factor(C,  E) 
meaning  that  E  is  a  factor  of  clause  C  according  to  Definition  2.11.  As  illustrated  in  Fig¬ 
ure  6,  Resolution  Residue  follows  the  set  of  support  restriction  on  an  initial  set  of  support 
of  clauses  from  -iG. 

2.5.3  Completeness  of  Resolution  Residue 

This  section  closely  follows  the  proof  of  completeness  of  Ordered  Residue  on  HOH-clauses 
in  Section  2.4.3.  The  completeness  theorem  for  resolution  residue  is  as  follows: 

Theorem  2.11  (Completeness  of  Resolution  Residue) 

Given  a  residue  D  for  world  model  W,  goal  G,  and  assumable  language  A,  where 
W  is  a  satisfiable  set  of  clauses, 

G  =  Gi  V  . . .  V  Gm,  where  the  G,  are  conjunctions  of  literals,  and 
D  =  {D1,...,Dn}  is  a  set  of  atomic  assumables, 

there  exists  a  set  of  support  deduction  of  n  clause  A  =  -i«i  V  ...  V  -i aq  from  initial  set 
of  support  Clauses(->G)  such  that  {ai,...,a?}  is  a  residue  for  W,  G,  and  A  and  such  that 
A  subsumes  — i D j  V  ...  V  -'Dn. 

Proof  Case  I:  (W  (=  3G) 

The  theorem  is  vacuously  true  for  this  case.  Let  D  be  the  null  set.  Then  we  have 

1.  D  subsumes  {} 

2.  W  U  D  is  satisfiable. 

3.  W  U  D  \=  3G. 

"The  wording  here  is  from  Loveland  [5'2] . 
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Case  II:  (W  £  3G) 

Let  M  =  W  U  D  U  Clauses(-iG).  Since  W  U  D  G,  M  is  unsatisfiable.  By  Herbrand’s 
Theorem  (Lemma  2.5),  there  must  exist  some  unsatisifiable  set  Mo  of  ground  instances 
of  M.  Let  Mvk  be  the  subset  of  clauses  in  Mo  from  W,  Mg  be  the  subset  of  clauses  in 
M0  from  D,  and  Mg  be  the  subset  of  clauses  in  Mo  from  Clauses(-iG).  Since  WuD  is 
satisfiable,  Mw  U  Md  is  satisfiable  and  Mg  is  non-empty.  By  Theorem  2.10  there  exists  a 
set  of  support  refutation  Ro  of  Mo  using  M a  as  the  initial  set  of  support.  Since  W  ^  BG, 
Ro  contains  at  least  one  clause  from  Mg  in  its  fringe. 

Consider  now  the  following  two  lemmas: 

Lemma  2.12  If  ground  clause  C'  is  a  resolvent  of  ground  clauses  A\  and  A'2  and  A\  and  A2 
are  such  that  A\  =  A\  U  Z\  and  A2  =  A2  U  Z2  for  sets  Z\  and  Z2  of  ground  literals,  then 
there  exists  a  ground  literal  C  =  C'  U  U  Z2  such  that  C  is  a  resolvent  of  A\  and  A2. 

Proof:  A\  and  A2  can  be  resolved  using  the  same  literals  as  was  used  in 
the  resolution  of  A\  and  A2.  The  resulting  clause  C  will  be  contain  every  literal 
of  C'  and  in  addition,  will  contain  any  extra  literals  from  Z\  and  Z2.  I 

Lemma  2.13  Given  a  ground  clause  C  that  is  a  factor  of  ground  clause  A' ,  then  there 
exists  a  factor  C  =  C'  U  Z  of  ground  clause  A  =  A'  U  Z,  where  Z  is  some  (possibly  empty) 
set  of  ground  literals. 

Proof:  For  ground  clauses,  factoring  consists  only  of  deleting  duplicate  liter¬ 
als.  The  literals  of  additional  literals  from  Z  can  not  alfect  a  preexisting  possible 
factorization.  I 


Consider  some  a  £  Mo,  that  is,  a  ground  atomic  assumable  appearing  in  Ro-  Ro  can 
be  modified  by  eliminating  a  resolution  of  a  set  of  support  clause  ->a  V  B  against  a  ground 
assumable  a.  Instead  of  using  the  resolvent  B,  by  Lemmas  2.12  and  2.13  a  corresponding 
set  of  support  derivation  Ri  can  be  constructed  with  the  negated  assumable  ->a  left  in 
the  tree  and  allowed  to  percolate  toward  the  top.  Taking  R\,  one  can  repeat  the  process 
constructing  R2, . . . ,  Rp  until  all  members  of  Mg  have  been  eliminated  from  the  fringe  of 
ordered  set  of  support  deduction  Rp.  Rp  is  then  a  deduction  of  some  clause  Cv  from  base 
set  Mw  UMd  such  that  (1)  Rv  consists  only  of  negations  of  ground  atomic  assumables 
from  Mg,  and  (2)  Rv  is  a  set  of  support  deduction  with  initial  set  of  support  Mg- 

By  the  Lifting  Lemma  (Lemma  2.-1),  Rv  can  be  converted  to  another  set  of  support 
derivation  D *  such  that  the  fringe  consists  of  clauses  from  W  U  Clauses(-iG).  The  root 
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of  D*  is  given  by  Cp6~l  =  ->a\  V  . . .  ~^a'q  for  some  substitution  6,  where  the  a[  axe  atomic 
formula  from  D  with  some  of  the  constants  in  these  formulas  possibly  replaced  by  variables. 
Thus  for  each  a[,  there  exists  some  substitution  <f>  such  that  a[4>  €  D. 

So,  we  have 

1.  -i a\  V  . . .  -i a'  subsumes  — >Di  V  . . .  -iD„ 

2.  W  U  {a^, . .  .,a'q}  is  satisfiable  since  it  has  a  satisfiable  instance  Mw  U  Mp 

3.  W  U  {«i, . . .  ,a'q]  |=  3G  by  Lemma  2.6. 


2.6  Residue  with  Answer  Extraction 

Given  a  residue  D  for  a  goal  G,  we  know  that  the  union  WUD  is  sufficient  to  entail  G. 
But,  if  G  contains  existentially  quantified  variables,  it  is  sometimes  desirable  to  know  for 
what  value  of  these  variables  the  goal  has  been  proven. 

Example  2.8  The  goal  “Put  a  block  on  top  of  block  A”  might  be  stated  as  the 
goal 

G  =  3x,  t  True(On(x,  A),  t)  A  (t  >  T0). 

Given  some  world  model  W  and  residue  D,  one  knows  that  some  block  can 
be  placed  on  block  A,  but  without  knowing  which  block  x  or  at  what  time  t. 
Knowing  the  value  of  t,  x,  or  both  might  be  important. 

It  is  not  always  the  case  that  there  is  a  single  variable  binding  that  can  be  given  as  the 
value  of  an  existential  variable.  Suppose  a  database  contains  the  axiom 


one  is  given  the  query 


Dog(Fido)  V  Dog( Rover), 


3x  Dog(x). 


It  can  be  proven  that  there  is  a  dog,  but  it  is  impossible  to  say  more  than  “Either  Fido  is 
a  dog  or  Rover  is  a  dog,”  that  is,  x  =  Fido  V  x  =  Rover.  Such  disjunctive  variable  bindings 
are  called  indefinite  answers  [75]  in  the  database  literature. 

Cordell  Green’s  method  of  answer  literals  [30,31]  is  well  known  for  extracting  such 
answers  from  resolution  refutations.  For  each  clause  of  the  negation  of  the  goal,  the  literal 
Ans(x)  is  added,  where  x  is  an  existentially  quantified  variable  of  G  whose  value  is  of  interest. 


J 
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If  there  exists  a  resolution  refutation  T  of  iG,  then  a  corresponding  deduction  tree  T  can 
be  built  starting  with  G  D  Ans(x),  that  is,  with  the  answer  literal  Ans(x)  added  to  each 
clause  in  the  negation  of  G.  The  root  of  T'  is  a  clause 

Ans(ai)  V  ...  V  Ans(a„), 

where  it  can  be  shown  that  W  {=  G  for  x  equal  to  at  least  one  of  the  a^.12  If  there  is  a 
definite  binding  for  x  =  a  for  which  W  (=  G  (as  proven  by  refutation  T),  then  a  unit  clause 
Ans(a)  will  be  the  root  of  T' . 

David  Luckham  and  Nils  Nilsson  [54]  found  a  slight  generalization  of  Green’s  method  in 
which  they  substitute  the  tautology  C;  V  ~>C{  for  each  clause  Ci  of  -iG.  Building  a  deduction 
tree  via  the  same  set  of  resolution  and  factoring  steps  as  in  the  refutation  of  — >G,  Luckham 
and  Nilsson  deduce  a  disjunction  of  one  or  more  formulas  -■  Ci,  but  with  the  appropriate 
variables  filled  in  with  values  used  in  the  refutation.  Since  each  such  expression  ->Ci  is  an 
instance  of  the  goal  (or  a  disjunct  of  the  goal),  one  can  extract  appropriate  values  for  each 
variable  of  interest.  Luckham  and  Nilsson  show  that  the  answers  they  obtain  are  at  least  as 
general  as  those  of  Green  (in  the  sense  of  binding  variables  with  as  little  restriction  on  their 
values  as  possible).  In  addition,  for  considering  values  for  several  variables,  Luckham  and 
Nilsson’s  method  returns  disjunctions  of  bindings  for  each  of  the  variables.  In  other  words, 
if  existential  variables  ,t,  y  and  z  appear  in  G,  Luckham  and  Nilsson’s  method  extracts  a 
disjunction  of  triples  of  values,  {a-i,  t/i,  zi}  V  ...  V  {xn,yn,zn}  rather  than  a  disjunction  of 
values  for  each  variable  separately.  See  Nilsson  [72]  for  a  more  detailed  exposition  of  this 
method. 

Given  an  answer  extraction  method  such  as  Green’s  of  Luckham  and  Nilsson’s,  we 
might  ask  whether  or  not  such  a  method  can  be  used  to  find  values  for  goals  to  a  residue 
procedure.  The  proofs  will  not  be  presented  here,  but  neither  Green’s  nor  Luckham  and 
Nilsson’s  answer  extraction  techniques  depend  upon  the  deduction  being  a  refutation  (that 
is,  a  deduction  of  the  null  clause  □),  and  thus,  either  method  is  applicable  to  both  Ordered 
Residue  and  Resolution  Residue. 

For  Ordered  Residue,  all  extracted  answers  will  be  definite  as  well.  Indefinite  answers  in 
resolution  proofs  can  occur  only  when  two  clauses  in  the  set  of  support  are  resolved  against 
each  other.  Since  all  clauses  in  the  initial  set  of  support  have  only  negative  literals,  and 
resolution  against  HOH-clauses  preserves  this  condition  on  the  set  of  support,  then  no  two 
clauses  in  the  set  of  support  can  ever  be  resolved  against  each  other. 

1JAs  first  suggested  by  Waldinger  (49j,  indefinite  answers  can  be  avoided  by  generating  a  conditional 
whenever  two  clauses  that  have  the  same  answer  literals  are  resolved.  Waldinger  and  Lee’s  PROW  [99] 
generated  conditionals  in  this  fashion,  and  Green  later  incorporated  conditional  generation  in  QA3. 
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2.7  Discussion 

The  previous  sections  of  this  chapter  presented  both  an  approach  to  design  and  two  residue 
procedures  for  generating  design  descriptions.  In  this  section  the  residue  approach  is  com¬ 
pared  and  contrasted  to  previous  work  in  deductive  synthesis. 

2.7.1  The  Single- Term  Approach 

In  1963  John  McCarthy  [66]  proposed  the  situational  calculus ,  that  is,  the  representation 
of  actions  as  a  mapping  from  a  state  to  a  state,  or  a  situation  to  a  situation  in  McCarthy’s 
terminology.  Within  the  next  two  years  a  system  by  W.  S.  Cooper  [18]  and  James  Slagle’s 
DEDUCOM  [84]  weie  published,  these  being  first  attempts  at  answering  database  queries 
via  a  proof  that  the  answer  exists.  The  year  1969  saw  a  synthesis  of  these  approaches  in 
Cordell  Green’s  QA3  [32,31]  and  Waldinger  and  Lee’s  PROW  [99,98]. 

Both  QA3  and  PROW  derived  designs  by  representing  the  design  as  a  term  about 
which  the  design  specification  can  be  proven  to  hold.  Their  approach  is  called  the  single¬ 
term  approach  here,  and  most  previous  work  in  deductive  synthesis  falls  into  this  category. 
In  the  single-term  approach,  the  proof  procedure  binds  the  term  to  an  expression  from 
which  the  design  can  be  ascertained.  This  expression  might  be  a  constant  (that  is,  a  known 
situation  in  which  the  design  specification  holds)  or  more  likely,  a  function  (that  is,  is  a 
composition  of  situation  to  situation  transformations  by  which  the  desired  situation  can  be 
reached).  Consider  the  following  example: 

Example  2.9  To  find  out  how  to  get  block  A  on  top  of  block  B,  the  system 
proposes  the  goal  formula 

3  d  ON(A,  B,  d) 

and  proves  that  this  formula  follows  from  the  axioms  describing  the  world.  The 
situation  d  for  which  ON(A,  B,d)  will  be  bound  to  a  description  of  a  plan  to 
achieve  such  a  goal.  Let  us  assume  that  block  A  is  clear  and  that  block  C  is 
on  top  of  block  B  in  the  initial  situation  So-  Then,  one  situation  d  for  which 
ON(A,  B,  d)  holds  is 


d  =  PUTON(A.  B,  PUTON(C, Table, S0)), 
where  the  PUTON  action  is  described  by: 


V«,h,s  CLEAR(«,.s)  A  CLEARS,  .s)  D  0N(«,  6,  PUTONfn, 6,  *)). 
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We  can  find  the  design  from  the  binding  of  d,  that  is,  one  puts  A  on  B  in  the 
situation  attained  by  putting  C  on  the  Table  in  So-13 

In  the  single-term  approach  the  problem  is  two-fold:  (1)  proving  that  the  goal  formula 
follows  from  the  axioms  describing  the  world,  and  (2)  constructing  the  “answer,”  i.e.,  ex¬ 
tracting  the  binding  of  the  situation  d  for  which  the  goal  formula  holds.  Just  as  for  the 
residue  approach,  any  number  of  different  proof  procedures  might  be  used.  A  number  of 
different  methods  have  been  developed  for  answer  extraction.  Green  [32]  originally  proposed 
the  idea  of  an  answer  literal  (See  Section  2.6),  an  idea  that  was  generalized  by  Luckham 
and  Nilsson  [54]  in  1971  and  by  Manna  and  Waldinger  [58]  in  1980. 14 

2.7.2  Problems  of  Expression 

As  outlined  in  the  above  section,  the  single-term  approach  represents  designs  as  a  single 
term,  that  is,  a  composition  of  transformations  of  one  situation  to  another.  For  problems 
in  which  backtracking  through  a  space  of  state  transformations  provides  adequate  perfor¬ 
mance,  the  single-term  approach  works  quite  well.  For  other  problems,  the  need  to  specify 
the  design  as  a  set  of  state  transformations  causes  difficulty;  a  finer-grained  set  of  constraints 
is  desirable  for  specification  of  a  design.  Residues,  that  is,  sets  of  atomic  formulas,  provide 
a  richer  language  of  constraints.  Let  us  look  at  some  of  the  difficulties  encountered  in  the 
single-term  approach  and  how  they  are  handled  in  the  residue  approach. 

Implicit  Linear  Ordering  A  term,  being  a  composition  of  functions,  implies  a  single 
linear  order  of  application  of  the  transformations.  This  ordering  has  been  used  in  various 
domains  to  specify  the  ordering  of  components  in  the  design.  For  example,  in  planning, 
the  innermost  function  specifies  the  first  action  taken,  the  next  innermost  function  specifies 
the  next  action,  and  so  on.  Similarly,  in  program  synthesis,  the  composition  of  functions 
specifies  application  of  functions  in  a  functional  programming  language  such  as  LISP.  In 
circuit  design,  each  function  represents  the  output  of  a  circuit  component  (such  as  a  resistor) 
whose  arguments  are  in  turn  functions  specifying  the  inputs  to  that  component. 

13This  example  is  a  rough  approximation  of  Green’s  methodology ;  Green  actually  developed  and  used 
answer  literals  (Section  2.6)  to  provide  indefinite  (disjunctive)  answers  in  situations  where  an  answer  provably 
exists,  but  it  is  impossible  to  say  what  the  answer  is.  Even  so,  each  potential  answer  is  represented  as  a 
single  term. 

14 On  the  surface,  Manna  and  Waldinger’s  approach  does  not  seem  to  extract  the  answer  as  the  binding 
of  a  term,  but  their  “Output  Column’’  can  be  viewed  as  a  means  to  more  freely  rewrite  the  design  term.  In 
addition,  authors  such  as  Wos  et  al  [  1 0(>]  represent  designs  via  a  fixed  number  of  terms,  but  this  does  not 
change  the  fundamental  nature  of  the  discussion  to  follow.  The  issue  is  not  one  or  many  terms,  but  the  use 
of  terms  rather  than  formulas. 
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If  one  chooses  to  build  a  specification  in  strictly  backwards  or  strictly  forward  order,  the 
above  approach  presents  no  problem.  Each  intermediate  specification  can  be  specified  by  a 
composition  of  functions.  On  the  other  hand,  one  cannot  easily  insert  components  into  the 
middle  of  such  a  sequence.  An  intermediate  design  sucl  as 

PUTON(A,  B,  PUTON(B,  C,s)) 

does  not  leave  room  for  specification  anywhere  except  at  the  innermost  position. 

Furthermore,  because  first-order  logic  does  not  allow  quantification  over  functions,  one 
cannot  “leave  an  empty  space”  in  the  sequence  of  compositions.  The  expression 

PUTON(A,B,/(a1,...,an,PUTON(B,C,s))) 

is  not  a  term  of  first-order  logic.  (In  addition,  one  has  no  idea  of  how  many  arguments  the 
function  /  will  have.) 

Residues,  on  the  other  hand,  specify  constraints  in  no  particular  order.  The  presence 
(or  lack  thereof)  of  a  particular  component  can  be  expressed  independently  of  its  temporal 
or  physical  location  in  the  design.  As  a  result,  there  is  no  difficulty  in  specifying  partial 
orderings,  or  in  inserting  a  component  between  two  others. 

Example  2.10  By  assuming  the  formulas 

Execution(Puton(A,  B),  Ti)  and  Execution(Puton(B,  C), T2) 

a  residue  procedure  can  easily  state  that  actions  Puton(A,B)  and  a  Puton(B,C) 
are  in  a  plan  without  stating  in  what  order  they  will  be  executed.  On  the  other 
hand,  the  single-term  approach  has  no  obvious  way  to  do  the  same;  it  must 
decide  in  what  order  the  actions  are  to  take  place  via.  the  design  term 

Puton(B,C,Puton(A,B,S0)) 

or  else  the  term 

Puton(A,B,Puton(B,C,S0)). 


Specification  of  Components  via  Functions  The  single-term  approach  relies  upon 
a  set  of  state  transformation  functions  to  specify  a  design.  As  mentioned  in  the  previous 
paragraph,  first-order  logic  does  not  permit  quantification  over  functions,  so  the  expression 


PUT0N(A,B,  /(«,,...,  a„,PUTON(B,C,s))) 
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is  not  a  term  of  first-order  logic.  Once  a  state  transformation  function  has  been  added  to  a 
term,  it  cannot  be  changed  —  there  is  no  way  to  leave  the  state  transformation  a  variable 
about  which  other  constraints  are  specified. 

Expressed  as  a  residue,  design  components  are  terms  rather  than  functions.  For  example, 
the  formula 

Execution(a,t) 

might  specify  that  action  a  is  to  be  executed  at  time  t.  One  can  certainly  quantify  over  the 
set  of  actions  a  (and  t,  too)  and  specify  additional  constraints  upon  a  such  as 

-iMoves-Blocks(a). 

Combining  Constraints  Residues  specify  designs  via  an  implied  conjunction  of  the  set 
of  formulas  in  the  residue.  Single  terms,  on  the  other  hand,  cannot  depend  upon  the 
semantics  of  logical  conjunction. 


Example  2.11  Consider  the  problem  of  stating  that  action  A  should  take  place 
more  than  3  but  less  than  8  seconds  after  action  B.  A  residue  procedure  can 
state 


Execute(A,Ti) 

Execute)  B,  T2) 

Tj  +  8  >  T 2 

Tj  +  3  <  T2, 

In  order  to  do  the  same  with  a  state  transformation,  one  might  invent  a  new  func¬ 
tion  such  as  Delay(ti,t2,s)  that  maps  a  state  into  an  identical  state  somewhere 
between  t\  and  t2  seconds  later  than  the  original.  The  above  plan  fragment 
could  then  be  expressed  as  B(Delay(3,8,A(So))). 

Suppose  one  then  decides  that  action  B  must  also  be  more  than  4  seconds  after 
action  A.  In  the  residue  approach,  one  simply  adds  another  constraint  to  the 
design,  namely  h  +4  <  t2.  In  the  single-term  approach,  even  though  the  function 
Delay  has  already  been  invented  there  is  still  a  problem.  Simply  composing  this 
additional  constraint  to  give 


Delay(4,8,B(Delay(3,8,A(5o)))) 
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is  not  the  correct  state  transformation.  Instead,  the  system  must  somehow 
unpack  the  original  composed  function 

B(Delay(3,8,A(50))) 

and  know  how  to  transform  it  to  the  desired  term 

B(Delay(4,8,A(S0))). 

One  might  imagine  such  a  program  for  taking  a  term  (containing  some  fixed  vocabulary) 
and  an  additional  constraint  and  from  these  two  elements  outputting  a  new  term  incorpo¬ 
rating  that  constraint,  but  to  do  so  is  ad  hoc,  at  best.  In  fact,  in  order  to  make  the  above 
transformation,  the  system  probably  would  have  to  unpack  the  term  to  something  closely 
resembling  a  set  of  formulas. 

In  contrast,  by  expressing  constraints  on  a  design  as  a  set  of  formulas,  there  is  no  need  for 
rewriting  the  entire  design  every  time  that  a  new  constraint  is  added.  Instead  of  a  problem 
to  be  solved  in  an  ad  hoc  and  probably  awkward  way,  there  is  a  simple  way  to  add  an 
additional  design  constraint,  namely  set  union,  and  there  already  exists  a  well  understood 
semantics  for  the  language  in  which  the  design  is  expressed. 

The  same  sort  of  problems  arise  in  trying  to  express  a  partial  ordering  of  actions,  con¬ 
straints  on  the  values  of  components  (restriction  on  the  voltage  drop  across  a  given  resistor, 
or  restrictions  on  the  allowed  color  or  shape  of  a  block,  for  example).  Such  constraints  fit 
poorly  into  the  single-term  approach  because  they  are  not  state  transformations;  they  are 
facts  about  the  design  being  constructed. 

2.7.3  Reasoning  about  Partial  Designs 

Much  of  the  rest  of  the  thesis  will  concern  itself  with  reasoning  about  a  partially  completed 
design  during  the  design  process.  Residue  procedures  depend  upon  their  ability  to  do 
consistency  checking.  Chapters  3  and  4  will  require  additional  reasoning  about  a  design. 

Terms  of  a  logic  are  indivisible  objects;  one  can  state  facts  about,  a  whole  term,  but 
not  about  a  part  of  a  term.  Thus,  in  order  to  reason  about  a  design  expressed  as  a  single 
term,  the  composed  functions  expressing  the  design  must  be  unpacked  into  a  set  of  facts 
about  which  we  can  reason.  In  the  residue  approach  this  is  obviously  already  the  case  — 
all  information  about  the  design  are  already  expressed  as  a  set  of  formulas;  there  is  no  need 
to  transform  the  design  into  a.  form  on  which  a.  system  can  reason. 
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2.7 A  Minimal  Answers 

It  is  not  always  the  case  that  a  completely  specified  design  is  needed.  If  order  of  two  actions 
in  a  plan  is  immaterial,  it  is  good  to  know  this  fact.  By  keeping  options  open,  one  allows 
choices  to  be  made  later  when  more  information  might  be  known,  and  one  allows  for  more 
possibilities  for  implementing  the  assumables  that  are  already  specified.  In  the  planning 
problem  of  Example  2.10,  for  example,  if  the  order  of  the  execution  of  the  actions  really 
did  not  matter,  a  parallel  implementation  would  be  possible.  As  a  residue  would  not  need 
to  specify  the  ordering  of  the  actions,  the  possibility  of  parallel  execution  would  fall  out  of 
the  residue  approach.15  As  long  as  the  parts  of  the  design  needed  to  prove  sufficiency  can 
be  expressed  as  atomic  formulas,  there  is  no  need  for  a  residue  to  specify  more. 

2.7.5  Mimicking  the  Single- Term  Approach  with  Residues 

II  it  is  desired  to  a  residue  procedure  in  a  manner  similar  to  the  single-term  approach, 
it  is  easy  to  do  so.  Suppose  that  instances  of  the  relation  EXECUTION  are  assumable. 
EXECUTION^,  s;,.s/)  means  that  action  a  will  be  executed  in  situation  s,  transforming  the 
world  to  sitation  sj.  Writing  axioms  about  state  transformation  operators  in  the  form 

Prereqs(a,s,)  A  Execution(a,s,,s/)  D  Postreqs(a,s/) 

one  gets  the  desired  behavior. 

Example  2.12  In  Section  2.7.1  the  following  axiom  was  given  for  the  PUTON 
using  a  single-term  approach: 

V  a.  b.  t  CLEAR(n.  .s)  A  CLEAR(6.  .s)  O  0N(a,  6,  PUTONfn,  6,  s)). 

To  get  the  same  behavior  for  a  residue  procedure,  the  above  axiom  would  be 
written  in  the  form: 

Va,6,si,s 2  CLEAR(rt,.si)  A 
CLEAR(7>,  S])  A 

EXECUTION(PUTON(n,f>),  si)  A 
PRECEDES(.s|,*2)  D  ON(«,6,.s2). 


lln  this  particular  example,  of  course,  the  order  of  the  execution  really  does  matter. 
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2.7.6  Consistency  Checking 

Consistency  checking  is  an  integral  part  of  the  residue  approach,  but  not  the  single-term 
approach.  Why  the  difference? 

At  first  glance,  the  answer  seems  to  be  that  residues  add  something  to  the  world 
model  W,  whereas  single  terms  do  not.  In  other  words,  the  single-term  approach  is  trying 
to  prove  that 

W  \=  3dG(d), 

where  G  is  the  design  specification,  whereas  a  the  residue  approach  tries  to  add  facts  D 
to  W  such  that 

W  U  D  j=  G.  (7) 

However,  one  need  only  rewrite  (7)  as 

w  t=  (DDG), 

where  D  =  /\  D,  for  all  D,  €  D,  to  see  that  there  must  be  some  other  difference.  After  all, 
the  term  d  might  specify  an  impossible  design  just  as  easily  as  the  formulas  D. 

The  real  reason  for  the  difference  is  the  set  of  search  paths  considered  by  single-term 
approaches.  As  discussed  in  Section  2.7.2,  single-term  approaches  can  build  up  a  design  in 
one  direction  only.  Due  to  the  limitations  of  expression  of  a  term,  one  cannot  build  up  a 
design  from  the  middle  outward,  but  must  either  build  up  the  design  from  the  outputs  to 
the  inputs  (a*  is  commonly  done)  or  perhaps  from  the  inputs  to  the  outputs.16 

Let  us  assume  that  one  is  working  backwards  from  the  goal.17  The  original  goal  is 
completely  regressed  through  each  state  transformation  to  give  new  goals18.  Thus,  at  any 
time,  the  state  transformations  coming  after  (after  in  the  design  itself,  not  in  the  design 
process)  have  already  been  entirely  specified  (modulo  variable  bindings  of  arguments  to  the 
state  tranformation  operators)  —  it  is  impossible  to  choose  which  aspects  of  the  design 
to  consider  in  detail  first.  The  importance  of  making  design  decisions  in  an  acceptable 
order  has  been  recognized  since  the  beginning  of  AI.  For  example,  one  of  the  main  ideas 
of  GPS  [71,23]  was  the  notion  of  solving  the  goal  spanning  the  “greatest  difference”  first. 
One  may  not  do  so  under  the  restrictions  of  the  single-term  approach. 

1  e In  order  to  be  able  to  design  in  other  orders,  one  might  consider  a  hierarchical  approach  as  in  Sacer- 
doti’s  ABSTRIPS  [80].  Given  such  an  approach,  one  can  postpone  decisions  about  the  details  of  parts  of  the 
design.  Unfortunately,  just  like  residue  methods,  a  hierarchical  approach  requires  checking  that  the  parts 
mesh  as  planned,  i.c.,  consistency  checking.  It  was  a  known  limitation  of  ABSTRIPS  that  no  such  checking 
was  done. 

17The  forward  case  is  similar,  and  to  the  best  of  the  author’s  knowledge,  no  such  single-term  system  has 
been  proposed. 

"’See  Waldinger  [97]  or  Nilsson  [72]  for  an  explanation  of  goal  regression. 
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Example  2.13  If  one  has  a  goal  of  flying  from  New  York  to  San  Francisco, 
then  taking  a  trip  by  ship  to  Hong  Kong,  and  driving  to  a  hotel  in  Hong  Kong, 
it  is  hardly  appropriate  to  plan  the  drive  to  the  hotel,  and  then  plan  the  boat 
trip,  and  finally  plan  the  plane  trip.  One  would  almost  certainly  want  to  plan 
the  cruise  first,  followed  by  the  plane  trip,  followed  by  the  drive.  Instead,  a 
unidirectional  search  through  a  set  of  state  transformations  forces  one  to  plan 
the  drive,  then  the  cruise,  and  then  the  flight. 

Single-term  approaches  generally  assume  that  every  composition  of  the  allowed  state 
transformation  operators  is  a  meaningful  design.  Because  such  terms  are  all  that  can  be 
created,  both  during  the  design  process  and  as  the  output  of  the  design  process,  there  is 
no  need  to  check  consistency.  What  is  lost  is  the  ability  to  consider  state  differences  in  any 
order,  to  specify  partial  orderings  on  operators,  to  insert  application  of  operators  between 
existing  operators,  or  to  partially  specify  operators19. 

If  the  above  capabilities  are  not  important,  the  world  model  for  a  residue  system  can  be 
set  up  to  mimic  the  approach  of  the  single-term  approach  (as  explained  in  Section  2.7.5), 
that  is,  one  may  perform  a  strictly  backwards  search  just  as  in  the  single-term  approach. 
Otherwise,  one  faces  the  need  to  design  via  successive  refinements  that  may  or  may  not  be 
consistent. 

2.8  Related  Work 

2.8.1  Reiter’s  Default  Logic 

Ray  Reiter  [74]  develops  what  he  calls  a  “Logic  for  Default  Reasoning”.  His  goal  is  to 
develop  a  logic  for  drawing  plausible  conclusions  that  art'  improvable,  hut  consistent  with 
the  initial  world  model.  For  example,  if  Fred  is  known  to  be  a  bird,  Reiter’s  system  will 
conclude  that  Fred  can  fly  unless  it  can  prove  otherwise.  Such  a  conclusion  is  made  by  using 
a  default,  which  Reiter  expresses  as: 

_  q(x)  :  M/?i(x),  •  ■  • .  M/3m(x) 

*  tn(x) 

The  default  6j  is  interpreted  as,  “If  a(x)  is  true  and  if  (3\  (x),  •  •  • ,  /?„(x)  can  be  consis¬ 
tently  believed,  then  w(x)  may  be  believed. 

19Such  systems  have  also  not  generally  dealt  with  specifications  on  the  form  of  the  design  such  as  “No 
loops  allowed'"  or  “No  more  than  four  NAND-gates  allowed.”  To  handle  such  specifications,  a  single-term 
system  would  be  forced  to  use  some  sort  of  ad  hoc  procedure  to  see  if  the  design  term  meets  this  specification. 
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In  the  example  above,  the  default  was  simply: 


bird(x )  :  M  flies(x ) 
flies(x) 

Reiter  defines  a  default  theory  A  =  ( D ,  W)  to  be  a  set  of  defaults  D  and  a  world  model 
W .  A  default  theory  may  have  extensions  E,  that  is,  smallest  sets  of  well-formed  formulas 
such  that: 

1.  W  C  E, 

2.  E  is  deductively  closed,  and 

3.  All  defaults  that  may  be  added  to  E  are  in  fact  included  in  E. 


In  order  to  decide  whether  a  formula  p  can  be  believed,  Reiter  asks,  “Given  p  and 
A  =  (D,W),  does  there  exists  an  extension  E  such  that  p  G  EV'  Unfortunately  the 
problem  is  in  general  intractable.  There  is  a  subclass  of  the  class  of  general  defaJ'  (that 
is,  defaults  expressed  by  (8))  such  that  there  is  a  proof  theory  for  the  above  question.  This 
subclass  is  called  the  class  of  normal  defaults  and  consists  of  defaults  of  the  form: 

=  ?W;M.,(x) 
w(x) 

Note  that  9  is  a  normal  default.  For  normal  default  theories,  i.e.,  default  theories  such 
that  all  6i  G  D  are  normal  defaults,  Reiter  proves  the  following  results: 


1.  All  normal  default  thcor:°s  h."”0  extensions. 


2.  For  a  formula  p  and  a  normal  default  theory  A,  there  exists  an  extension  E  of  A  such 
that  p  G  E  if  and  only  if  there  exists  a  top  down  default  proof  of  p  with  respect  to  A. 


Top  down  default  proofs  are  Reiter’s  proof  procedure  for  finding  a  subset  Da  C  D  such 
that  for  all  formulas  p,  p  £  E  &  W  U  Consequents!  Dn)  b  p,  where  the  Consequents  of  a 
default  theory  D  is  the  set  consisting  of  the  'U’(x)  for  each  default  <5,  G  D. 


Relation  of  Residue  and  Default  Logic  Both  the  residue  approach  and  Default  Logic 
begin  with  a  world  model  W .  In  addition,  there  is  a  language  of  assumables  A  for  residues 
corresponding  exactly  to  Reiter’s  defaults  D.  In  fact,  each  assumable  can  be  expressed  as: 
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The  meaning  of  (11)  is  that  tn(x)  may  be  assumed  if  it  is  consistent  to  do  so.  It  is  seen 
from  (10)  and  (11)  that  assumables  are  a  proper  subclass  of  the  set  of  normal  defaults, 
namely  the  class  where  the  prerequisite  a(x)  is  always  true. 

The  fact  that  a(x)  =  True  for  all  assumables  means  that  Top  Down  Default  Proofs  re¬ 
duce  to  a  residue  procedure.  For  this  case,  one  need  not  construct  a  sequence  of  derivation 
trees,  but  only  perform  a  single  derivation,  just  as  in  a  residue  procedure.  There  is  a  small 
difference  in  that  Reiter’s  proofs  assume  a  control  strategy  called  Linear  Resolution  (see 
Loveland  [51]  or  Luchkam  [53])  rather  than  set  of  support  resolution.  Linear  Resolution 
seemingly  was  used  only  for  the  sake  of  simplicity.  For  set  of  support  resolution,  Reiter’s 
proofs  go  through  virtually  unchanged.  Although  Linear  Resolution  is  complete,  the  disad¬ 
vantage  of  being  forced  to  use  Linear  Resolution  is  that  it  can  force  the  space  to  be  searched 
in  an  inappropriate  direction.  For  example,  linear  resolution  can  sometimes  force  forward 
chaining,  which  often  is  far  branchier  than  backward  chaining. 

Whereas  a  residue  consists  of  a  set  of  assumables  found  necessary  by  the  proof,  Reiter 
defines  default  support  as  the  set  of  defaults  invoked  in  a  given  top  down  default  proof. 
These  are  in  exact  correspondence. 

The  correspondence  of  designs  to  extensions  is  a  bit  trickier.  If  two  different  designs  are 
inconsistent,  then  they  cannot  both  belong  to  the  same  extension.  On  the  other  hand,  there 
will  in  general  be  numerous  extensions  of  which  a  given  design  is  a  member.  A  design  only 
specifies  part  of  the  world;  what  happens  outside  of  the  design  is  irrelevant  to  the  design, 
but  changes  the  extension.  For  example,  a  plan  might  specify  all  the  actions  in  the  world 
from  time  t  =  0  until  t  =  10,  but  says  nothing  about  events  after  t  =  10.  Every  inconsistent 
course  of  events  after  f  =  10  will  be  in  a  separate  extension. 

Although  residues  are  somewhat  simpler  than  Reiter’s  proof  theory,  the  approach  unfor¬ 
tunately  suffers  from  the  same  computational  complexity  as  Reiter’s  work.  Both  are  based 
on  decision  methods  for  first-order  logic  and  thus  are  at  best  semi-decidable.  However, 
neither  is  even  semi-decidable  because  both  depend  upon  proving  satisfiability  of  the  world 
model  W  unioned  with  the  set  Dn  of  assumptions  made.  As  a  result,  both  Default  Logic 
(as  Reiter  points  out)  and  residue  procedures  must  rely  on  heuristic  methods  to  become 
convinced  of  satisfiability. 


2.8.2  Truth  Maintenance 

A  major  weaknesses  of  the  residue  procedures  presented  so  far  is  that  there  is  1.0  way  to  learn 
from  previous  mistakes.  The  non-deterministic  search  procedure  presented  in  Section  2.3 
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is  neutral  on  the  subject  of  caching  results  of  previous  deductions.20  It  does  not  specify  a 
procedure  for  avoiding  the  pitfalls  of  blind  search  or  of  chronological  backtracking,  but  does 
not  tell  us  we  cannot  add  such  a  procedure. 

In  his  work  on  assumption-based  truth  maintenance  systems  (ATMS)  [21,20,19]  Johan 
de  Kleer  has  listed  a  number  of  known  problems  with  chronological  backtracking.  Let  us 
review  them  here  as  they  relate  to  Residue. 

In  Futile  Backtracking,  one  finds  a  contradiction  and  backs  up  to  a  point  that  could  al¬ 
ready  be  eliminated  as  contradictory  based  on  previously  found  contradictions.  For  example, 
if  {x  —  l,y  —  1}  is  a  contradictory  set,  then  backing  up  from  a  point  {x  =  1,  y  =  1,  z  =  0} 
to  another  point  {x  =  1,  y  =  l,z  =  1}  is  futile.  In  a  residue  procedure,  if  some  set  G  (con¬ 
sisting  of  assumables  and  remaining  goals)  causes  a  contradiction,  i.e.,  W  U  G  |=  false, 
then  it  can  pay  to  note  this  “nogood”  set  (Steele  [91]).  Future  set  of  assumables  that  either 
contain  or  entail  the  set  G  can  be  immediately  eliminated. 

Closely  related  is  rediscovering  contradictions.  In  futile  backtracking  we  did  not  undo 
one  of  the  choices  in  the  newly  discovered  nogood  set.  In  addition,  we  might  find  an  old 
contradiction  via  a  different  path.  As  above,  the  caching  of  nogood  sets  will  solve  this 
problem. 

Rediscovering  inferences  is  a  different  problem.  The  fact  that  a  deduction  was  performed 
on  a  different  search  path  does  not  make  it  inapplicable  on  the  present  path.  As  long  as  the 
deduction  was  not  based  on  a  contradictory  set  of  assumptions,  then  the  deduction  itself 
is  valid  whether  of  not  the  search  path  on  which  it  was  made  was  a  dead  end  or  not.  In 
residue  procedures,  this  is  particularly  important  while  checking  consistency.  There  is  no 
reason  not  to  cache  facts  derived  from  W  and  various  assumables  in  a  global  database  to 
be  used  by  all  search  paths  (the  subject  of  Chapters  3  and  4). 

Incorrect  ordering  is  de  Kleer’s  name  for  finding  contradictions  at  the  right  time.  Of 
course,  there  is  no  general  solution  for  this  problem.  In  residue  procedures,  the  problem 
expresses  itself  as  the  resource  trade-off  between  goal  reduction  and  consistency  checking. 

As  de  Kleer  points  out,  a  Truth  Maintenance  System  (Doyle  [22])  solves  the  first  three 
problems.  Since  Doyle’s  TMS,  there  have  been  various  other  versions  of  TMS’s,  in  particular 
McAllester  [64,62,63],  Martins  [61],  McDermott  [67],  and  de  Kleer  [21,20].  Common  to  each 
of  these  systems  is  storing  of  the  justications  and  assumptions  upon  which  a  deduction  is 
based.  However,  they  differ  in  the  choice  of  information  maintained.  Doyle’s  “justification- 
based”  TMS  stores  justification  pointers  to  its  immediate  predessors  and  successors,  but 
does  not  propagate  new  sets  of  support  for  each  node  at  each  step  of  the  way.  On  the 
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other  hand,  de  Kleer’s  “assumption-based”  TMS  (the  ATMS)  maintains  a  complete  list  of 
“environments”,  that  is,  assumption  sets  upon  which  each  deduction  might  be  based.21  As 
a  result,  one  can  quickly  see  whether  a  node  is  a  member  of  a  given  context  (the  deductive 
closure  of  W  U  G  for  some  set  of  assumptions  G.  However,  insertion  of  a  new  fact  into  such  a 
database  is  expensive,  as  its  effects  on  the  various  environments  must  propagate  throughout 
the  system.  The  justification-based  TMS,  on  the  other  hand,  maintains  a  single  consistent 
context  at  all  times.  As  long  as  contradictions  are  not  found,  insertion  of  new  facts  is  quite 
inexpensive.  However,  when  contradictions  are  found,  the  TMS  may  need  to  do  a  good  deal 
of  search  in  order  to  find  a  new  consistent  context. 

2.8.3  Douglas  Smith 

In  [87]  Douglas  Smith  used  a  natural  deduction  system  for  program  synthesis,  using  a 
modified  single-term  approach  in  extracting  the  answer  from  the  proof  tree.  Rather  than 
prove  that  the  generated  program  was  true  for  all  preconditions,  Smith  attempted  to  reduce 
his  goal  to  a  set  of  preconditions  that  fit  into  one  of  a  number  of  existing  skeletons.  The 
skeletons  were  not  assumable,  but  rather  were  attached  to  procedures  for  modifying  both 
the  remaining  preconditions  and  for  extracting  the  program  from  the  existing  derivation. 
See  [88,89]. 

2.8.4  PROLOG/EX1 

In  Prolog/ExI  [101,100],  Adrian  Walker  modified  Prolog  such  that  in  case  of  failure  to 
produce  an  answer,  the  system  returns  an  explanation  of  what  additional  facts  would  be 
needed  to  produce  an  answer.  The  elements  of  such  an  explanation  of  failure  are  analogous 
to  assumables  in  residue  procedures.  As  such,  Walker  faced  the  problem  of  deciding  what 
sufficient  set  facts  are  most  reasonable  to  assume  to  explain  the  failure.  Not  just  any  set  of 
facts  will  do  —  the  query  itself  obviously  suffices  for  producing  a  proof  of  the  query,  but  is 
hardly  an  acceptable  explanation  of  why  the  proof  failed.  Walker’s  approach  was  to  use  a 
set  of  three  domain-independent  rules  for  deciding  what  proof  steps  to  assume  based  upon 
depth  in  the  proof  tree  and  upon  constants  in  the  query. 

2.8.5  Theorist 

At  the  University  of  Waterloo,  David  Poole,  Randy  Goebel  and  associates  have  recently 
developed  a  system  called  Theorist  [29,73]  for  theory  formation  problems.  As  Poole  et  al 

21  Possible  only  because  de  Kleer  deads  only  with  propositional  calculus. 
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have  pointed  out,  Theorist’s  theories  are  similar  to  residues,  and  they  also  have  pointed  out 
the  similarity  with  Reiter’s  Default  Theories  [74].  There  is  also  a  good  deal  of  similarity 
between  assumability  and  that  of  appropriateness  of  an  explanation  for  diagnosis  and/or 
theory  formation  problems.  Just  as  the  constitution  of  a  legal  design  is  rather  arbitrary, 
the  same  holds  true  for  an  explanation  of  a  problem.  At  some  level,  either  the  system  or 
the  system  designer  must  legislate  what  consititutes  a  sufficient  explanation  that  need  not 
be  further  explained,  just  as  we  legislate  what  design  assumptions  can  be  made  without 
further  explanation.  Luckily,  for  design,  the  problem  of  deciding  what  are  the  primitive 
components  of  a  design  seems  much  easier  than  deciding  what  constitutes  a  primitively 
acceptable  explanation.  See  Charniak  and  McDermott  [16]  and  especially  McDermott  [68] 
for  pessimistic  views  on  finding  such  criteria  for  explanation. 

2.9  Conclusion 

This  chapter  presents  the  residue  approach  to  design  synthesis  and  two  procedures  for 
finding  residues.  By  using  a  set  of  atomic  formulas  to  express  designs  rather  than  using 
a  single  term,  one  gains  two  important  advantages:  (1)  Residues  can  easily  express  much 
finer-grained  decisions  than  systems  based  upon  a  single  term,  and  (2)  Parts  of  designs  can 
be  specified  in  any  order  rather  than  strictly  from  the  goal  backwards  or  from  the  initial 
state  forward.  In  addition,  the  representation  of  partially  complete  designs  as  sets  of  facts 
enables  an  inference  system  using  predicate  calculus  notation  to  reason  directly  about  the 
design.  For  a  given  system  if  the  full  generality  of  the  residue  appraoch  is  not  needed, 
the  database  can  be  written  in  such  a  way  that  the  residue  approach  reduces  to  a  system 
isomorphic  to  the  single- term  approach. 

Residue  procedures  have  been  used  in  a  number  of  projects  at  Stanford  —  DART  [28] 
used  residues  to  generate  diagnostic  tests  for  combinatoric  circuits  and  in  his  PhD  the¬ 
sis,  Narinder  Singh  [83]  used  a  residue  procedure  for  generating  diagnostic  tests  for  IBM 
Printer  Adapter  cards.  Residues  were  also  used  in  Russ  Greiner’s  analogy  understanding 
program  [34]  and  in  Jock  Mackinlay’s  APT  [55,56]  tool  for  automatic  graphical  presenta¬ 
tion. 

The  generality  of  the  residue  approach  contains  the  seeds  of  a  potential  combinatoric 
explosion.  In  particular,  in  the  most  general  case  residue  procedures  require  a  consistency 
check,  which  is  a  non-semidecidable  problem.  Usually,  however,  careful  crafting  of  the 
knowledge  base  prevents  consistency  checking  from  being  prohibitively  expensive. 

Chapters  3  a.nd  4  show  how  the  full  generality  of  the  residue  approach  can  sometimes 
be  exploited  to  bring  about  large  reductions  in  the  search  space. 


Chapter  3 

Supersumption 


3.1  Ramifications  of  a  Goal 

It  is  well-nigh  impossible  to  drive  a  car  without  consuming  fuel.  Similarly,  writing  a  new 
version  of  a  disk  file  will  change  numerous  parameters  associated  with  that  file  such  as  write- 
date,  disk  address,  and  size.  In  both  cases,  the  latter  condition  inevitably  accompanies  the 
former;  the  latter  is  a  ramification  of  the  former. 

Making  the  assumption  that  the  world’s  behavior  (or  at  least  the  relevant  portion  of  it) 
can  be  modelled  by  a  set  of  first-order  axioms  W,  ramifications,  such  as  the  above,  can  be 
captured  by  logical  implication.  In  other  words,  if  W  is  to  capture  fuel  consumption  as  a 
ramification  of  driving,  a  formula  denoting  consumption  of  fuel  must  be  a  logical  implication 
of  any  formula  denoting  driving  of  a  car. 

A  goal  G  is  a  partial  description  of  a  state.  If  some  formula  N1  is  logically  implied 
by  W  U  G,  then  in  every  state  S  described  by  G,  whatever  is  denoted  by  N  also  holds;  N  is 
a  ramification  of  G’s  being  true.  The  process  of  finding  ramifications  of  a  goal  can  be  seen 
as  (1)  looking  at  a  partial  description  of  a  hypothetical  state  S  in  which  G  is  holds  and 
(2)  trying  to  fill  in  more  of  the  description  of  S  based  upon  our  knowledge  of  consistent  or 
“allowed’’  states  of  the  world  (where  this  knowledge  is  captured  in  W). 

Obviously,  it  may  take  an  arbitrary  amount  of  inference  to  find  a  particular  ramification 
N.  Some  facts  follow  immediately  from  G  while  others  may  require  a  complicated  line  of 
reasoning  to  discover.  Of  course,  the  “distance”  between  a  goal  G  and  ramification  N  is  a 
function  of  the  database  and  of  the  inference  engine  at  hand.  Having  a  large  file  in  one’s 
directory  almost  immediately  leads  one  to  realize  that  it  will  be  expensive  to  keep  the  file. 
On  the  other  hand,  it  may  take  a  good  deal  more  inference  to  see  that  the  presence  of  that 

'The  symbol  N  is  being  used  for  a  ramification  instead  of  R  to  avoid  mistaking  the  symbol  for  a  residue. 
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file  precludes  receiving  any  additional  electronic  mail.2 


3.2  Using  Ramifications  of  a  Goal 

There  are  a  number  of  ways  in  which  to  use  the  known  ramifications  of  a  given  goal’s  being 
true.  First  and  foremost  is  elimination  of  an  inconsistent  goal  —  if  a  ramification  of  a  goal’s 
being  true  is  known  to  be  impossible,  then  the  goal  itself  is  not  achievable.  Reduction  of 
problems  to  the  Halting  Problem  is  a  good  example  of  this.  Given  some  goal  G,  if  we  show 
that  achieving  G  implies  that  we  can  solve  the  Halting  Problem,  then  G  is  impossible  to 
achieve.  A  more  mundane  example  would  be  to  try  to  find  a  disk  file  that  is  larger  than  1 
MByte.  If  it  is  known  that  all  files  are  stored  on  256  KByte  floppies,  it  can  immediately  be 
said  that  there  can  be  no  such  file.  On  the  other  hand,  if  the  ramifications  of  the  goal  being 
true  are  ignored,  all  the  files  in  the  system  will  probably  be  enumerated,  checking  the  size 
of  each  to  see  if  it  is  larger  than  1  MByte.  Of  course,  no  such  file  will  be  found. 

Another  use  of  ramifications  is  restriction  of  the  search  space  via  additional  constraints. 
Suppose  again  that  a  file  that  is  larger  than  1  MByte  is  sought.  This  time,  however,  there 
is  a  50  MByte  hard  disk  and  hundreds  of  2-56K  floppy  disks.  From  the  above  information,  it 
can  be  derived  that  a  file  larger  than  1  MByte  must  reside  on  the  hard  disk.  Thus,  files  that 
reside  on  the  floppies  need  not  be  considered,  and  a  good  deal  of  search  has  been  avoided. 
Elimination  of  an  inconsistent  goal  is  a  special  case  of  restriction  via  additional  constraints, 
that  is,  the  case  in  which  the  entire  search  space  is  eliminated  from  consideration  rather 
than  just  part  of  it. 

A  third  use  of  ramifications  is  in  enabling  additional  heuristics  to  be  used.  It  may  be 
the  case  that  whatever  heuristics  used  are  not  directly  applicable  to  the  goal  at  hand,  but 
are  directly  applicable  to  some  ramification  of  the  goal.  Without  filling  out  more  of  the 
description  of  the  goal  state,  the  use  of  potentially  applicable  heuristics  may  be  missed. 
Suppose  we  are  looking  for  an  executable  file  for  playing  chess.  Any  solutions  for  the  goal 
will  also  fall  into  the  category  of  being  executable  files  for  playing  a  game.  If  it  is  known 
that  most  executable  game  files  are  located  on  directory  /usr/games,  then  we  would  be  well- 
advised  to  look  here  first  for  such  a  file.  Note  that  without  realizing  that  chess  is  a  game 
(that  is,  finding  a  ramification  of  the  goal),  we  would  not  have  known  that  the  heuristic 
information  applies  (that  is,  that  most  executable  game  files  reside  on  /usr/games). 


2This  situation  often  arises  in  operating  systems  such  as  TOPS-20  that  impose  a  hard  limit  on  the  amount 
of  dish  space  a  user  is  allowed  to  have. 
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3.3  Subgoals,  Design  Decisions  and  Ramifications 

The  discussion  so  far  has  been  about  ramifications  of  a  given  goal  being  true.  The  discussion 
applies,  however,  not  just  to  the  top  level  goal  of  any  problem,  but  to  any  subgoal3  generated 
in  trying  to  solve  the  problem. 

Example  3.1  Consider  the  goal  G  of  being  in  Denver,  starting  from  a  state  of 
being  in  San  Francisco.  One  way  to  achieve  G  is  via  a  commercial  airline  flight 
from  San  Francisco  to  Denver,  that  is,  a  subgoal  S  to  take  a  flight  from  San 
Francisco  to  Denver.  If  the  cheapest  such  flight  costs  $150,  then  a  ramification 
N  of  S  being  true  is  that  at  least  $150  will  be  spent.  On  the  other  hand,  N  is 
not  a  ramification  of  G  being  true;  there  might  well  be  a  way  to  get  from  San 
Francisco  to  Denver  for  less  money,  say  $100. 

The  above  example  shows  that  ramifications  of  achieving  a  subgoul  are  not  necessarily 
ramifications  of  achieving  the  original  goal.  Achievement  of  a  subgoal  may  have  additional 
ramifications  because  the  subgoal  may  be  more  restrictive  than  the  original  goal  —  there  is 
additional  information  from  which  to  reason.  In  Example  3.1,  in  reducing  G  to  S  there  was 
a  design  decision  made,  namely,  to  take  a  commercial  flight.  Since  S’s  ramification4  that 
the  trip  will  cost  at  least  $150  was  based  upon  this  design  decision,  the  ramification  will 
not  necessarily  hold  for  the  original  goal  G. 

The  process  of  goal  reduction  can  be  viewed  as  a.  process  of  making  design  decisions 
restricting  the  class  of  solutions  to  consider.  Although  most  of  the  previous  examples 
have  been  about  ramifications  of  a  top-level  goal,  anything  that  can  be  said  about  using 
ramifications  of  a  top-level  goal  can  also  be  said  about  using  ramifications  of  a  subgoal. 
All  ramifications  of  a  subgoal  will  not  necessarily  hold  for  the  top-level  goal,  but  since  the 

3There  is  potential  confusion  in  that  the  logic  programming  community  uses  subgoal  to  refer  to  a  conjunct 
of  a  goal  (which  is  a  conjunction  of  literals).  Here,  subgoal  is  used  to  mean  an  entire  goal  (usually  a 
conjunction)  to  which  another  entire  goal  has  been  reduced. 

4The  word  “ramification”  usually  refers  .to  one  thing  being  true  as  part  and  parcel  of  another’s  being 
true.  One  might  speak  of  the  “ramifications  of  a  fact  being  true,”  i.e,  “ramifications  of  achteving  a  goal,” 
and  one  might  also  speak  of  the  “ramifications  of  having  a  given  goal.”  In  the  interest  o{  brevity  let.  us  make 
the  convention  that  “ramification  of  a  goal  G”  refers  to  a  ramification  of  achieving  the  goal  G,  rather  than  a 
ramification  of  having  the  goal  G.  Such  usage  is  consistent  with  viewing  a  goal  G  as  a  partial  description  of  a 
desired  state.  It  is  also  natural  to  speak  of  “ramifications  of  a  design  decision,”  and  again,  what  is  meant  is 
a  ramification  of  implementing  the  design  decision  rather  than  a  ramification  of  making  the  design  decision. 
Since  both  goals  and  design  decisions  are  represented  as  formulas  (See  Chapter  2),  there  is  no  formal  need  to 
distinguish  between  ramifications  of  goals  and  ramifications  of  design  decisions  —  it  is  only  •:  portant  that 
it  be  understood  that  ramifications  of  a  goal  or  design  decisions  refer  t  o  achieving  a  goal  and  ip/crm  ntmg 
the  design  decisions. 
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subgoa]  is  a  sufficient  condition  for  the  top-level  goal,  any  ramification  of  the  top-level  goal 
will  be  a  ramification  of  any  of  its  subgoals.  Later  it  will  be  shown  (Section  4.7)  that  there 
are  cases  in  which  a  subgoal  is  derived  from  more  than  one  higher-level  goal  and  that  a 
higher-level  goal  may  have  ramifications  that  are  not  ramifications  of  the  subgoal. 

3.4  Formal  Definition  of  Ramifications 

The  previous  sections  have  discussed  in  broad  terms  how  ramifications  of  a  goal  being 
true  can  be  useful  in  pruning  the  search  space  of  a  problem.  Let  us  now  give  a  precise 
formulation  of  a  ramification  a  goal’s  being  true.  Although  it  is  tempting  to  simply  say 
that  a  ramification  is  any  formula  N  for  which  W  (=  (G  D  N),  the  existence  of  variables 
requires  a  more  careful  definition. 

Definition  3.1  (Ramification)  Let  W  be  a  satisfiable  set  of  closed  formulas,  and  let  G  be 
a  closed  formula  in  prenex  normal-form,  that  is,  G  =  □iX]  . .  .□;nxmG,  where  □,  is  either  3 
or  V,  and  G  is  a  quanitifier-free  formula  whose  only  free  variables  are  Xi, . . . ,  xg.  Suppose  N 
is  a  formula  whose  only  free  variables  are  y\  ,...,yn  such  that  {yi, . .  ■  ,yn)  Q  {a.’i, . . . ,  xff}. 
We  say  that  N  is  a  ramification  of  G  given  W  (or  Ramification(W,  G,  N  ))  if 

W  |=  Vxi  ...V*fl(G  D  N).  (12) 

In  addition,  we  say  that  N  is  a  strong  ramification  of  G  given  W  (or  StrongRam(W,  G,  N)^ 
if  Ramification(W,  G,  N)  and  W  ^  Vpi...VynN.. 

Note  that  for  every  formula  N  containing  free  variables  zi,...,zq  not  in  G  there  is 
obviously  a  corresponding  formula  N'  =  Vz i  ...zgN.  As  a  result,  the  restriction  that  N’s 
free  variables  be  a  subset  of  G’s  free  variables  causes  no  loss  of  generality.5 

Example  3.2  Consider  a  goal  G  =  3x3?/  A(x)AB(.t,  y)AC(t/)  and  W  containing 
a  formula 

VuVwVu/  (A(u.)  A  C(u))  D  D(u,  v,  w). 

Then  the  open  formula  VwD(r,  y,  w)  is  a  ramification  of  G  given  W. 


'In  Chapter  4,  where  ramifications  will  he  viewed  as  clauses,  it  will  he  necessary  that  uiK|uantific<l  vari¬ 
ables  appear  in  ramifications  that  do  not  appear  in  the  goal  formula.  This  is  only  an  artifact  of  representation 
via  clauses;  flics  variables  are  implicitly  universally  quantified. 
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Example  3.3  Consider  the  goal 

G  =  3xFlies(x)  A  Hairy(x)  A  Loves(x,  Bertha).  (13) 

Suppose  the  database  W  contains  the  formula 

V  y  [Flies(y)  A  Hairy(y)  D  Bat(y)].  (14) 

Since  follows  from  (14)  that 

W  |=  V  x  [Flies(x)  A  Hairy(x)  A  Loves(x,  Bertha)  0  Bat(x)],  (15) 


Bat(x)  is  a  ramification  of  G  given  W. 

The  literal  Loves(x,  Bertha)  is  not  needed  to  show  that  Bat(x)  is  a  ramification 
of  G  and  has  no  effect  on  whether  Bat(x)  is  a  ramification  of  G.  Similarly  we 
might  be  able  to  derive  other  ramifications  dependent  upon  Loves(x,  Bertha), 
but  with  no  dependence  upon  Hairy(x). 

In  the  previous  examples  the  ramification  was  provable  via  application  a  single  appli¬ 
cation  of  modus  ponens  using  a  proposition  from  W.  This  restriction  need  not  hold,  as  in 
the  following  example: 

Example  3.4  Consider  a  goal  formula 

G  =  Travel(Palo  Alto,  Denver)  A  Duration(Palo  Alto,  Denver)  <  4,  (16) 

that  is,  the  task  at  hand  is  to  plan  a  trip  from  Palo  Alto  to  Denver  that  takes 
4  hours  or  less.  Suppose  W  contains  axioms  (17)  -  (20): 


(17) 


V x,  y,  d,  t  Travel(x,  y) 

A  Dist(x,y)  >  d 
A  Duration(x,  y)  <  t 
D  AvgSpeed(x,  y)  >  j, 

that  is,  the  average  speed  required  for  a  trip  is  greater  than  or  equal  to  the 
distance  covered  divided  by  the  maximum  time  allowed  for  the  trip, 


Vx,y,.si,.s2,m 


AvgSpeed(x, y)  =  sx 
A  MaxSpeed(m)  =  s2 
A  si  >  s2 
D  Mode(x.y)  ^  m, 


(18) 


o'vxv-r, 
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that  is,  if  the  average  speed  required  in  getting  from  x  to  y  is  greater  than  the 
maximum  speed  for  a  given  mode  of  travel,  then  that  mode  of  travel  will  not  be 
used  for  the  this  portion  of  the  trip. 

Dist(Palo  Alto,  Denver)  =  1000,  (19) 

that  is,  the  distance  between  Palo  Alto  and  Denver  is  1000  miles,  and 

MaxSpeed(Auto)  =  70,  (20) 

that  is,  the  maximum  speed  for  an  automobile  trip  is  70  miles  per  hour. 

Consider  the  formula  N  =  AvgSpeed(Palo  Alto,  Denver)  =  250.  Since  W  (=  (G  D  N), 
that  is, 

W  |=  [Travel(Palo  Alto,  Denver)  A  Duration(Palo  Alto,  Denver)  <  4] 

D  AvgSpeed( Palo  Alto,  Denver)  =  250, 

N  is  a  ramification  of  G.  Similarly, 

Mode(Palo  Alto,  Denver)  b  Auto  (21) 

is  also  a  ramification  of  G. 


Ramifications  in  the  Residue  Approach  Chapter  2  discussed  design  as  a  problem  of 
finding  a  residue,  that  is,  a  problem  of  reducing  a  goal  G  to  a  subgoal  Dj  A  . . .  A  Dj  such 
that  for  D  =  { Di , . . . ,  Dd} 

1. WUD  [=  G 

2.  Each  D,  is  assumable. 

3.  W  U  D  is  satisfiable. 

Coals  and  designs  are  distinguished  only  by  the  property  of  assumnbility  (See  Section  2.2.3). 
A  legal  design  is  a  goal  that,  happens  to  be  assumable.  Moralise  residue  procedures  express 
design  decisions  as  part  of  a  goal,  there  is  no  need  to  distinguish  between  ramifications  of 
a  goal  being  achieved  and  ramifications  of  making  a  design  decision  —  both  can  be  derived 
front  W  and  from  a  given  goal  (or  subgoal)  generated  by  the  residue  procedure. 
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3.5  Supersumption 

The  key  idea  of  this  chapter  is  supersumption,6  the  reformulation  of  a  goal  by  appending  of 
additional  constraints.  In  this  thesis,  all  such  added  constraint  will  be  ramifications,  but  it 
is  reasonable  to  consider  adding  other  constraints  to  a  goal,  for  example,  constraints  that 
probably  follow  from  the  goal. 

Definition  3.2  (Supersumption)  Given  closed  well-formed  formulas  G  =  Dzi . . . ,  DzgG 
and  G',  where  G  is  quantifier- free,  G'  supersumes  G  (or  “G1  is  a  supersumption  of  G  ”)  if 

G'  =  □ii..,Dij(GAA), 

where  A ’s  only  free  variables  are  {jq ,  - .  • ,  ya}  and  {jq , . . . ,  ya]  C  {zi , . . . ,  xg). 

Definition  3.2  only  requires  logical  equivalence  between  Dzi . . .  □z3(G  A  A)  and  G'.  It 
does  not  require  that  G'  actually  be  the  formula  G  A  A.  In  particular,  if  G  and  A  arc 
conjunctions,  this  definition  says  nothing  about  the  ordering  of  the  conjuncts  in  G'. 

If  A  is  a  ramification  of  G,  then  the  same  set  of  bindings  for  x  cause  G  and  G'  to  be 
entailed  by  W. 

Theorem  3.1  For  a  goal  G  =  Dzi . .  .  ,Dz5G,  world  model  W  and  Ramification(W, G,  N), 
every  model  ofW  and  G  is  a  model  of  G  A  N. 

Proof:  Because  N  is  a  ramification  of  G,  we  know  that 

W  (=  Vzj  ...Vz5(G  A  N).  (22) 

Let  M  be  an  arbitrary  model  of  W  and  G.  Bv  the  deduction  theorem  and  (22), 

M  must  also  be  a  model  for  N,  and  therefore  for  G  A  N.  Thus, 

W  |=  □z,...Dz,(GAN). 


note  that  Theorem  3.1  did  not  preclude  W  containing  a  set  D  of  formula  such  that 
W'  U  D  p  G,  where  W'  =  W  -  D.  In  other  words,  we  have  the  following  corollary: 

6The  motivation  for  the  term  supersumption  is  that  it  is  in  some  sense  opposite  to  subsumption  in  which  a 
disjunction  C  (the  denial  of  a  goal)  is  matched  with  a  known  formula  consisting  of  a  subset  of  C’s  disjuncts. 
Supersuraption,  on  the  other  hand,  adds  conjuncts  to  a  goal;  it  creates  a  goal  G1  the  denial  of  which  is 
subsumed  by  the  denial  of  the  original  goal  G.  If  one  considers  supersumptions  of  non-conjunctive  goals 
then  the  analogy  breaks  down. 


,•1 

A 
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Corollary  3.2  If  D  is  a  residue  of  G  =  □ii...DxsG  given  world  model  W,  and  if 
Ramification(W,  G,  N),  then  D  is  also  a  residue  of  of  Dii . .  .Qzs(G  A  N). 


Example  3.5  Consider  the  goal  of  trying  to  find  an  executable  file  named  chess 
on  some  computer  system: 


G  =  3xFile(:r)  A  Name(x,  chess)  A  Executable^). 


One  possible  supersumption  of  G  would  be  formed  by  adding  an  additional 
constraint  A  =  Directory-of(x,  /usr/games)  to  form  a  new  goal 

G'  =  3x  File(x)  A 

Name(x,  chess)  A  ^ 

Executable(x)  A 
Directory-of(x,  /usr/games). 


If  A  =  Directory-of(x,  /usr/games)  is  a  ramification  of  the  goal,  then  the  new  goal  G',  created 
by  adding  the  conjunct  Directory-of(x, /usr/games)  to  G,  has  the  same  set  of  solutions  as  G. 


3.6  Speedup  Via  Supersumption 


As  discussed  in  Section  3.2,  supersumption  can  reduce  a  search  space  by  (1)  eliminating 
inconsistent  goals,  (2)  restricting  search  space  via  additional  constraints,  and  (3)  allowing 
additional  heuristics  to  be  used.  The  first  and  third  of  these  mechanisms  are  easily  under¬ 
stood.  In  this  section,  the  second  mechanism,  restriction  of  the  search  space,  is  discussed 
in  more  detail. 

The  purpose  of  supersumption  is  to  reformulate  a  goal  G  as  a  new  goal  G'  that  is  cheaper 
to  solve  than  G.  As  in  all  reformulations,  there  is  a  saving  if 


Cost(Reformulation(G))  +  Cost(Solving(G'))  <  Cost(Solving(G)). 


In  other  words,  the  savings  in  finding  solutions  for  G'  must  more  that  offset  the  overhead 
of  reformulating  G.  For  our  purposes,  '‘Solving(G)”  can  refer  to  finding  solutions  by  either 
abduction  or  deduction,  and  can  refer  to  problems  of  finding  one  solution  or  finding  all 
solutions  (assuming  a  finite  number  of  solutions). 

Unfortunately,  estimating  the  cost  of  solving  a  given  problem  is  not  a  well  developed 
area.  In  order  to  make  the  problem  at  all  tractable,  let  us  make  the  following  assumptions: 


— 
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1.  G  and  G'  are  ordered  conjunctive  goals.  A  conjunctive  goal  is  a  closed  formula  g  = 
□xj . . .  Oxg(gi  A . .  .A gn),  where  each  gl  is  an  atomic  formula.  Thus  3.rVyA(x)AB(x,  y) 
is  a  conjunctive  goaf,  whereas  3xVy(A(x)  V  B (x,y))  D  D(x,y)  is  not.  Just  as  clauses 
in  resolution  can  be  seen  as  being  either  sets  or  sequences  of  literals,  so  can  conjunc¬ 
tions.  An  ordered  conjunctive  goal  is  a  sequence  of  atomic  formulas.  In  an  ordered 
conjunctive  goal  it  is  meaningful  to  refer  to  the  first  (leftmost)  conjunct  and  to  speak 
of  stepping  though  the  conjuncts  one  at  a  time  from  the  first  to  the  last  (or  rightmost) 
conjunct. 

2.  Conjuncts  will  be  solved  in  order  starting  at  the  left,  that  is,  the  leftmost  conjunct 
is  solved  independent  of  the  remaining  conjuncts,  and  any  constraints  needed  for  the 
solution  to  the  leftmost  conjunct  (variable  bindings  or  assumptions  made)  are  imposed 
upon  the  remaining  conjuncts,  and  the  process  begun  anew. 

For  problems  for  which  the  above  two  assumptions  hold,  the  supersuinption  process  is 
illustrated  by  Figure  7.  For  such  goals  G,  one  first  creates  a  new  goal  G  A  N  and  then  orders 
the  conjuncts,  yielding  an  alternative  goal  G'. 

Chapter  4  deals  with  finding  ramifications  for  a  given  conjunctive  goal  G  and  world 
model  W.  The  subject  of  ordering  conjunctive  goals  has  received  attention  in  both  the 
database  and  AI  literatures.  The  approaches  that  have  generally  made  the  following  as¬ 
sumptions: 

3.  Al!  solutions  {yi  — *■  4) . y3  —  Yj)  to  W  |=  G  are  sought,  where  {t/j  ...,?/,}  is 

a  subset  the  set  of  the  existentially  quantified  variables  in  G  and  the  Y\  are  ground 
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instances  of  the  y{  J 

4.  For  every  conjunct  R(Ti, . .  .,Tn)  appearing  in  either  G  or  in  G',  R  is  either  evaluable 
or  extensional,  where  R  is  a  relation  symbol  and  Tx  is  a  term.  An  evaluable  relation 
is  such  that  the  truth  value  of  any  ground  instance  of  an  atomic  formula  containing 
this  relation  can  be  ascertained  in  constant  time.  Examples  of  evaluable  relations  on 
integers  are  less-than,  oddp,  or  positivep.  Extensional  relations  are  such  that  all  known 
ground  instances  of  each  conjunct  containing  this  relation  appear  directly  in  W;  no 
additional  ground  instances  of  the  conjunct  are  entailed  by  W.  In  short,  each  conjunct 
appearing  in  G  or  G'  may  simply  be  looked  up  in  W  to  find  all  of  its  known  ground 
instances. 

As  stated  above,  the  subject  of  conjunct  ordering  has  been  studied  in  under  most  or 
all  of  the  above  assumptions.  Authors  such  as  Blasgen  and  Eswaren  [3],  King  [44]  and 
Cliakravarthy  [13]  make  Assumptions  1-4  in  their  work,  as  is  common  in  the  database 
literature.  David  E.  Smith  [86,85]  adds  another  assumption,  that  every  extensional  relation 
is  indexed  on  each  of  its  arguments. 

This  research  assumes  that  there  exists  the  means  to  order  conjuncts  reasonably.  As¬ 
sumptions  1  and  2  are  made,  but  Assumptions  3-5  will  be  explicitly  stated  if  they  are  being 
assumed. 


3.6.1  Generators  and  Filters 

It  will  be  useful  to  distinguish  two  ways  that  a  conjunct  can  act  with  respect  to  a  variable, 
as  a  generator  or  as  a  filter  of  its  values.  If  the  ordered  conjunction  is  to  be  solved  in  order, 
then  for  every  variable  x  in  every  conjunct  C,  it  can  be  said  whether  x  will  be  grounded  or 
not  when  solutions  are  to  be  found  for  C.  The  first  (leftmost)  appearance  of  each  variable 
will  not  be  grounded  and  the  rest  grounded.  Thus,  the  first  conjunct  C  in  which  variable  x 
appears  generates  values  for  x;  we  say  that  C  is  a  generator^  "or  x.  For  any  subsequent, 
conjuncts  D  in  which  x  appears  there  will  be  solutions  for  some  subset  of  the  values  of  x 

'  Note  that  this  assumption  disallows  indefinite,  i.e.,  disjunctive  solutions.  See  Reiter  [75]  for  a  discussion 
of  indefinite  solutions. 

Note  also  that  this  assumption  is  not  very  strong  in  that  the  cost  of  finding  one  solution  can  usually  be 
reasonably  approximated  as  the  time  of  finding  all  solutions  divided  by  the  number  of  solutions. 

Tn  programming,  given  a  problem  (or  subproblem)  with  many  solutions  Si,  Si,...,  a  ycnerator  is  the 
name  commonly  given  to  a  procedure  that  returns  one  solution  each  time  it  is  called,  and  if  the  S,  are 
exhausted,  returns  a  token  saying  that  it  can  produce  no  more  solutions.  Generators  are  commonly  in. 
(demented  via  coroutines,  Algol  own  variables,  reference  to  global  data  structures,  or  some  other  form  of 
memory  between  calls. 
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Figure  8:  Speedup  Obtained  Using  Additional  Constraint  as  Generator 


generated  by  C]  we  say  that  D  is  a  filter  lot  x.  Note  that  a  conjunct  can  be  a  generator  for 
some  variables  and  a  filter  for  others. 


Example  3.6  Given  the  ordered  conjunctive  goal  C(x)  A  D(x,  y)  A  E(x,  y),  C(x) 
acts  as  a  generator  for  x ,  D(x,  y)  acts  as  a  filter  for  x  and  a  generator  for  y,  and 
E(x,y)  acts  as  a  filter  for  both  x  and  y. 


3.6.2  Ramifications  as  Generators 

One  of  the  ways  in  which  a  supersumption  G'  =  G  A  A  can  be  cheaper  to  solve  than  G  itself 
is  for  the  additional  constraint  A  to  act  as  a  generator  of  values  for  some  vaiiable  x. 

Let  us  consider  a  very  simple  case  in  which  the  cost  of  finding  all  solutions  to  a  conjunct  C 
is  equal  to  the  number  of  ground  instances  of  C  appearing  in  W.  The  above  assumption 
corresponds  roughly  to  a.  database  lookup  on  a  fully  indexed  extensional  relation. 


Example  3.7  Consider  a  database  W  of  information  about  the  files  of  a  com¬ 
puter  system.  Suppose  that  all  solutions  to  each  of  the  atomic  formulas  Executable!/), 


3.6.  SPEEDUP  VIA  SUPERSUMPTION 


Compiler-File(/),  and  Directory-of(/,  d )  can  be  found  with  cost  equal  to  the  num¬ 
ber  of  solutions.  Here,  Executable  is  a  unary  relation  holding  for  all  executable 
files,  Compiler-File  is  a  unary  relation  that  holds  for  all  files  of  source  code,  bi¬ 
nary  code  or  documentation  for  compilers  of  all  programming  languages,  and 
Directory-of  is  a  binary  relation  such  that  the  second  argument  is  the  name  of 
the  directory  on  which  the  file  named  by  the  first  argument  is  found.  Only  if 
d  is  a  ground  term  is  it  assumed  that  the  cost  of  generating  all  solutions  to  a 
query  Directory-of(/,  d)  is  equal  to  the  number  of  solutions. 

Given  the  above,  suppose  we  have  the  query 

G  =  3 /Executable(/)  A  Compiler-File(/), 


that  is,  a  request  has  been  made  to  list  all  executable  compiler  files.  Assuming 
the  number  of  executable  files  is  less  than  the  number  of  compiler  files,  the  goal 
G  is  already  optimally  ordered.  G  uses  Executable(/)  as  the  generator  of  values 
for  /  and  filters  this  set  of  values  with  Compiler-File(/).  The  cost  of  finding  all 
solutions  to  G  would  be 


Cost(Solving(G)) 


and 


Card(Executable(/))  + 
Card(Executable(/)  fl  Compiler- File(/)), 


Card(Executable(/))  <  Cost(Solving(G))  <2*  Card(Executable(/)), 


where  Card(s)  is  the  cardinality  of  the  set  s. 

Now,  suppose  that  it  is  known  that  all  executable  compiler  files  are  on  directory 
/bin/compilers.  For  example,  W  might  contain  a  formula 

Vy  Executable(y)  A  Compiler-File(y)  3  Directory-of(y, /bin/compilers). 

In  such  a  case,  N  =  Directory-of(/,  /bin/compilers)  would  be  a  ramification  of  G, 
yielding  one  possible  supersumption 


G1  =  3/Directory-of(/, /bin/compilers)  A  Executable(/)  A  Compiler-File(/). 


The  cost  of  finding  all  solutions  to  G'  is 
Cost(SolvingfG'))  = 

Card(Directory-of(/,  /bin/compilers))  + 

Card(Directory-of(/, /bin/compilers)  fl  Executable(/))  + 
Card(Directory-of(/,  /bin/compiiers)  fl  Executab!e(/)  fl  Compiler-File(/)). 
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Assuming  that 

Card(Directory-of(x, /bin/compilers))  <C  Card(Executable(x)) 
it  can  be  seen  that 

Cost(Solving(G/))  <  Card(Directory-of(/, /bin/compilers))  -f 
2  *  Card(Executable(/)) 

<  Cost(Solving(G)). 

The  above  example  is  illustrated  in  Figure  8. 

In  the  above  example,  it  is  assumed  that  the  cost  of  solving  a  conjunct  is  equal  to 
the  number  of  solutions.  Such  an  assumption  essentially  means  that  there  is  no  search 
for  solutions  —  they  are  simply  enumerated.  In  situations  where  more  search  is  involved, 
supersumption  can  be  a  more  powerful  tool. 


Example  3.8  Consider  the  goal 

G  =  3xFilename(x,  Chess)  A  Executable(x), 

that  is,  find  the  set  of  all  executable  files  named  chess.  This  time,  let  us  assume 
that  both  Filename(x,  Chess)  and  Executable(x)  can  be  solved  with  cost  propor¬ 
tional  to  the  number  of  files  in  the  entire  computer  system,  typically  O(I05). 
Ground  instances  of  both  of  these  conjuncts  can  be  solved  with  unit  cost.  The 
above  cost  assumptions  are  a  reasonable  model  of  most  computer  systems.  A 
list  of  all  files  with  a  given  name  is  not  usually  directly  available  on  most  sys¬ 
tems.  To  get  such  a  list  one  must  somehow  enumerate  all  the  files  on  the  system 
and  then  filter  this  list  to  find  those  with  a  given  name.  Thus,  assuming  that 
one  knows  how  to  enumerate  all  the  files  of  the  entire  system  (without  further 
search  for  such  a  procedure),  one  can  find  the  set  of  files  named  Chess  in  time 
proportional  to  the  total  number  of  files. 

Given  such  a  goal,  a  human  user  would  typically  either  find  a  short  cut  based 
on  other  available  information,  or  else  resign  himself  to  a  large  search.  Let  us 
see  how  supersumption  can  use  such  additional  information  in  order  to  reduce 
the  search. 
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Suppose  now  that  it  is  known  that 

Game(Chess) 

and  that 

VyGame(y)  A  Executable^)  D  Directory-of(y, /usr/games), 

that  is,  all  executable  files  for  games  are  located  on  directory  /usr/games.  In 
such  a  case,  N  =  Directory-of(x, /usr/games)  is  a  ramification  of  G,  and  the  goal 

G'  =  3xDirectory-of(x,  /usr/games)  A  Filename(x,  Chess)  A  Executable(x) 

is  the  obvious  reordering  of  3x(GaN),  where  G  =  Filename(x,  Chess)AExecutable(x). 

In  constrast  to  the  other  two  conjuncts,  Directory-of(x, /usr/games)  asks  for  in¬ 
formation  that  is  directly  available  to  a  typical  system,  that  is,  the  solutions  to 
it  can  be  enumerated  with  cost  proportional  to  the  number  of  solutions  to  the 
conjunct.  Typically,  a  directory  will  contain  0(1O2)  files,  and  thus 

Cost(Solving(G/))  «  102  +  1+1  «  102. 


A  speedup  of  three  orders  of  magnitude  seems  quite  large,  but  for  such  a  problem  it  is  not 
unrealistic  —  an  unreasonable  goal  was  given  to  the  system.  Using  available  information, 
one  is  able  to  reformulate  such  a  goal  to  yield  much  more  reasonable  one.  If  all  goals  given 
to  a  system  were  formulated  in  the  optimal  fashion,  there  would  be  no  need  for  many  sorts 
of  optimizations.  Indeed  systems  would  have  to  be  far  less  robust.  But,  it  is  for  precisely 
such  badly  stated  goals  that  techniques  such  as  supersumption  are.  needed  —  to  allow  a 
problem-solving  system  to  handle  a  wider  range  of  goals  with  acceptable  speed. 

The  above  examples  contained  only  a  single  variable.  In  addition,  the  additional  constraint 
was  added  in  front  of  the  first  conjunct  of  the  original  query  G.  In  general,  however, 
there  may  be  many  variables,  and  the  additional  constraint  A  need  not  appear  as  the  first 
conjunction  in  order  to  act  as  a  generator  for  some  variable  Xj. 

3.6.3  Additional  Restrictions  on  Arguments 

A  related  way  in  which  ramifications  can  Itelp  in  generation  is  by  restriction  of  additional 
variables  on  a  given  database  lookup. 
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Example  3.9  Suppose  W  contains  ground  formulas  of  the  form 

Teaches((lnstructor),  (Subject),  (Hour),  (room)),  (26) 

and  the  following  query  is  posed: 

G  =  3i3rTeaches(i,  Math,  lpm,  r).  (27) 

If  it  is  known  (or  can  be  derived)  that  Fred  is  the  only  teacher  of  afternoon  math 
courses,  i.e., 

V  i,  h  Teaches(i,  Math,  A,  r)  A  Afternoon(A)  D  i  —  Fred  (28) 

then  one  instead  can  look  for  tuples  of  the  form 

G'  =  3rTeaches(Fred,  Math,  lpm,  r).  (29) 

If  the  Teaches  relation  is  indexed  on  its  first  argument,  the  constraint  that 
i  =  Fred  can  result  in  a  large  speedup  in  enumerating  solutions.  If  the  first 
argument  is  the  only  argument  indexed,  then  G  requires  scanning  all  tuples  of 
the  Teaches  relation.  G',  on  the  other  hand,  requires  scanning  only  those  tuples 
with  Fred  in  the  first  position. 

Even  if  other  arguments  of  the  Teaches  relation  are  indexed,  G'  can  still  be 
cheaper  than  G.  For  example,  if  the  second  argument  is  indexed,  one  would 
have  to  scan  all  Math  classes  to  solve  G,  a  set  that  would  typically  be  much 
larger  than  the  set  scanned  in  solving  G*.  the  set  of  classes  that  Fred  teaches. 

Query  improvement  via  restrictions  on  variables  is  discussed  thoroughly  in  King  [44]  and 
by  17 .  S.  C-hakravarthy  [12,14,13].  Note  also  that  equality  is  not  the  only  useful  restriction 
that  can  be  found.  If  the  set  of  tuples  for  a  given  relation  are  sorted  according  to  some 
ordering  of  one  of  its  arguments  (a.  “sorted  index”),  then  finding  some  restriction  on  the 
range  of  values  for  this  argument  can  also  allow  for  a  speedup.  Again,  see  King  [44]  for 
many  such  examples. 

3.6.4  Ramifications  as  Filters 

Besides  helping  to  generate  fewer  possibilties  to  test,  knowing  ramifications  can  reduce  the 
expense  of  finding  answers  in  by  filtering  partial  answers.  Consider  a  goal 
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In  order  to  find  the  answers  to  such  a  goal,  the  set  of  x’s  for  which  A(x)  holds  is  generated 
and  then  for  each  of  these  x’s,  the  set  of  y’k  for  which  B(x,y )  holds  is  generated. 

Suppose  now  that  some  ramification  N  =  N(x)  is  derived  for  G.  Consider  the  modified 
(and  reordered)  goal 

G'  =  3x3j/A(x)  A  N(x)  A  B(x,  y). 

If  N(x)  reduces  the  number  of  x’s  to  consider,  then  one  can  avoid  generating  the  set 
of  possible  j/’s  for  each  of  the  x’s  eliminated  from  consideration.  Note,  of  course,  that  the 
proper  reordering  of  the  conjuncts  A(x),  B(x,y),  and  N(x)  is  necessary  to  take  advantage 
of  any  possible  speedup. 

Restating  the  above,  let  a  is  the  number  of  x’s  for  which  A(x)  holds  and  6  be  the  average 
cost  of  finding  all  y' s  for  which  B(X,  y)  holds,  where  X  is  an  arbitrary  ground  term.  Then, 
without  using  N(x)  as  a  filter,  O(ab)  pairs  of  values  must  be  considered  for  the  ordered 
conjunction  3x3j/A(x)  A  B (x,y),  that  is 

Cost(Solving(G))  =  Cost(Solving(A(x)))  +  ab.  (30) 

Assume  now  that  N(x)  holds  only  for  some  fraction  £  of  the  a  answers  for  A(x).  Then,  for 
ordered  conjunct  G'  =  A(x)  A  N(  x)  A  B(x,  y), 

Cost  (Solving(  G' ) )  — 

Cost(Solving(A(x)))  + 
aCost(Solving(N(X))  + 


In  other  words,  only  £  of  the  pairs  (x,y)  that  were  considered  for  G  need  be  considered  for 
G'.  By  filtering  one  generated  set  first,  the  size  of  its  cross  product  with  another  set  iias 
been  reduced. 


Example  3.10  Suppose  we  are  looking  for  the  set  of  radical  staff  members  of 
presidents,  that  is, 

G  =  3x3yPresident(x)  A  OnTheStafff.r .  y)  A  Radical(.iy).  (31) 

In  order  to  find  the  pairs  (.r,  y)  satisfying  this  formula,  we  would  have  to  generate 
the  10  .r's  for  which  President(.c )  hold,  and  then  for  each  of  these  10  presidents 
we  must  generate  the  say  A  people  for  whom  OnTheStafffx,  y )  holds,  for  some 
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y.  Finally,  Radical(y)  is  looked  up  on  (x,y)  pairs  already  generated.  It  generates 
no  new  pairs  (although  it  may  eliminate  some  old  pairs),  so  in  total,  40 K  pairs 
of  values  will  be  considered. 


Lookup 


Number  of  Values  Generated 


President(x) 
OnTheStafF(x,  y) 
Radical(  ?/) 


40 

K 

1 


(32) 


Product 


40  K 


Now,  suppose  W  contains  the  proposition  “Only  a  Democrat  would  have  a 
radical  on  his  staff,”  that  is, 

V  u,  v  [  Radical(w)  A  OnTheStaff(u,  v)  D  Democrat(u)  ] .  (33) 

Since  Democratic)  is  a  ramification  of  G,  consider  the  new  ordered  query 

G'  =  3.r3yPresident(.r)  A  Democrat(x)  A  OnTheStafF(x,  y)  A  Radical(y).  (34) 

It  is  true  that  all  40  presidents  must  still  be  generated,  but  only  6  of  the  40 
presidents  have  been  Democrats.9  Thus,  the  ramification  Democrat! x)  filtered 
the  set  of  40  presidents  to  only  6  Democratic  presidents.  By  doing  the  filtering 
before  generating  the.  cross  product  of  x  and  y,  we  generate  only  OK  rather  than 
40 A'  pairs  as  shown  in  Figure  35. 


Lookup 


Number  of  Values  Generated 


Presidentf  x ) 


10 


!9 
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Figure  9:  Using  a.  Ramification  as  a  Filter 

Note  also  that  the  ramification  Democrat(x)  would  be  a  terrible  generator  in 
this  example.  Since  there  are  many,  many  more  than  40  Democrats,  trying  to 
use  the  ramification  Democrat(x)  as  a  generator  would  have  generated  this  huge 
set  before  using  President(x)  to  reduce  the  set  to  only  6  Democratic  presidents. 

3.7  Summary 

This  chapter  has  defined  ramifications ,  conditions  that  must  accompany  the  achievement 
of  a  goal,  and  supersumption ,  the  reformulation  of  a  goal  to  include  additional  constraints 
such  as  ramifications.  One  purpose  of  finding  and  using  ramifications  is  to  widen  the  set 
of  heuristics  known  to  Ire  applicable  to  a  goal.  Viewing  goals  as  being  partial  descriptions 
of  states,  ramifications  help  complete  the  state’s  description  thereby  making  a  potentially 
iarger  set  of  heuristics  directly  applicable.  Alternatively,  ramifications,  acting  either  as 
filters  or  generators  of  goal  reductions,  can  reduce  the  size  of  the  search  space  in  which 
solutions  might  be  found.  As  a  filter  a  ramification  is  a  formula  that  can  be  evaluated 
for  a  given  goal,  and  if  false,  the  goal  can  be  immediately  pruned.  As  a  generator,  the 
ramification  is  used  as  a  source  of  potential  goal  reductions;  if  it  is  significantly  cheaper 
to  generate  solutions  via  the  ramification  than  via  the  original  goal,  an  overall  savings  can 
result. 

I  here  has  been  a  good  deal  of  work  related  to  supersumption,  finding  ramifications, 
and  the  mechanisms  by  which  supersumptions  of  a  goal  can  be  cheaper  to  solve  than  the 
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Chapter  4 


Finding  Ramifications 


4.1  Introduction 

Chapter  3  defined  ramifications  and  showed  how  they  can  reduce  the  size  of  a  search  space. 
This  chapter  considers  the  problem  of  generating  ramifications,  that  is, 

Given  a  database  W  of  facts  and  a  goal  G,  what  procedure  V  will  enumerate 
formulas  N  such  that 

W  |=  Vx,  . .  .Vx5  (G  D  N),  (36) 

where  G  is  assumed  to  be  in  prenex-conjunctive  normal  form1  with  matrix  G  a 
prefix  Oxi  . . .  Dxg  and  such  that  yn  C  (xi, . . . , x5),  where  y/v  is  the  set  of  free 
variables  of  N . 

For  any  such  procedure  V ,  we  must  consider  a  number  of  factors.  First,  how  efficient 
is  VI  Second,  is  V  sound  and  complete ,  that  is,  for  a  given  W  and  G,  will  P  eventually 
generate  all  N  for  which  (36)  holds  and  none  others?^  Third,  does  V  lend  itself  to  caching 
its  results  for  use  in  finding  ramifications  on  other  goals? 

Four  procedures  will  be  considered  for  generating  ramifications: 

1.  Lexical  Generation  of  Ramifications  ( Vi.rx ), 

2.  Natural  Deduction  on  Subgoals  (’Pyai), 

3.  Resolution  on  Goal  Clauses  {V/iac),  and 

1  Prenex-disjiitu  live  normal  form  is  also  acceptable.  See  Manna  [Vf]  loi  presentation  of  prom  x  <  on  pim  fix  e 
and  prefix-disjunctive  normal  forms.  The  important  result  is  that,  every  sentence  of  (nst-oidr  r  predicate 
calculus  can  be  converted  to  a  logically  equivalent  sentence  in  prenex-conjniu  tive  (ui  in  prenex-di-jiinefive) 
normal  form  in  time  proportional  to  lire  length  of  the  formula. 

‘  Actually,  a  slightly  weaker  notion  (defined  i:i  Section  •1.S)  of  completeness  suffices  and  will  In  used  hens 
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4.  Resolution  with  Partial  Subsumption  ( Vrps )• 

The  first  two  methods,  VLex  and  V^au  are  sound  and  complete,  but  terribly  inefficient; 
they  are  briefly  presented  to  show  that  soundness  and  completeness  are  easily  achieved,  but 
in  and  of  themselves,  these  procedures  are  of  little  interest.  The  remaining  two  methods 
are  based  on  binary  resolution  [76]  under  the  set  of  support  restriction  [105].  The  use  of 
resolution  in  these  two  procedures  differs  from  the  usual  use  of  resolution.  Traditionally, 
resolution  has  been  used  as  a  refutation  technique,  that  its,  a  technique  for  showing  that  a  set 
of  sentences  is  unsatisfiable.  Vrgs  and  Vrps  use  resolution  as  a  deduction  technique,  that 
is,  a  way  to  find  logical  implications  of  a  given  sentence  (Section  4.5  contains  a  discussion 
on  the  motivation  for  such  usage  of  resolution.)  Vrgc  is  a  straightforward  use  of  resolution 
as  a  forward  inference  technique.  Soundness  and  completeness  results  are  proven  for  Vrgc 
in  Section  4.5.  Vrps  is  a  less  obvious  and  superior  way  in  which  to  find  ramifications. 
Vrps  derives  ramifications  via  resolutions  on  the  world  model  W,  but  restricts  the  allowed 
resolutions  via  a  restriction  defined  in  terms  of  the  goal  G.  Vrps  allows  different  goals  to 
share  their  searches  for  ramifications,  and  even  without  such  caching  is  identical  in  efficiency 
to  Vrgc- 

Complexity  of  Finding  Ramifications  For  arbitrary  first-order  formulas,  the  complex¬ 
ity  of  finding  ramifications  is  bound  by  the  limitation  that  the  set  of  ramifications  of  a  goal 
is  recursively  enumerable,  but  not  recursive.  A  procedure  such  as  VLex  is  such  an  enumer¬ 
ation  procedure,  but  there  is  no  procedure  to  decide  in  finite  time  whether  an  arbitrary 
formula  is  or  is  not  a  ramification  of  a  given  goal.  It  is  easy  to  see  this:  If  there  were  such  a 
algorithm,  one  could  decide  whether  a  given  formula  N  is  a  ramification  of  the  goal  “true”, 
that  is,  whether  or  not 

W  j=  N, 

a  known  impossibility.  Thus,  one  can  enumerate  all  ramifications,  and  one  can  determine 
in  a  finite  (though  unbounded)  amount  of  time  that  a  given  formula  is  a  ramification  of  a 
given  goal  and  world  model,  but  one  cannot  in  general  determine  in  a  finite  amount  of  time 
that  a  given  formula  is  not  a  ramification  of  a  given  goal  and  world  model. 


4.2  Lexical  Generation  of  Formulas  (Vux) 

One  sound  and  complete  but.  very  inefficient  method  of  generating  all  ramifications  of  a  goal 
is  to  systematically  generate  every  possible  well-formed  formula  ( wff ),  checking  each  one  to 
see  if  it  is  a  ramification  of  the  goal  G.  Procedure  pLer  is  an  outline  of  such  a  procedure. 


i - 1  .**  |  |  , 


4.3.  NATURAL  DEDUCTION  ON  SUBGOALS  (VNAT) 


Procedure  Vux 

c  <—  0  (*  number  of  candidate  wffs  so  far  *) 
for  /  <—  1  to  oo  do  (*  length  of  wff  *) 

Fi  *—  {  f  |  /  is  a  wff  of  length  /  and  contains  free  variables  xo  C  x  } 
for  each  /  €  Fi  do 

Bc  -  W  U  CNF(  -(Van  •  •  .Vzs  (G  3  /))  ) 
c  « —  c-f- 1 

for  k  <—  0  to  c  do 

r  result  of  a  resolution  step  on  Bk  (if  possible) 
if  r  =  NULL  then  output(/t) 


It  is  easy  to  see  that  Viex  is  complete.  There  are  a  finite  number  (modulo  variable 
names)  of  wffs  of  a  given  length  and  quite  simple  to  find  an  algorithm  for  generating  them 
all  in  a  finite  length  of  time.  Such  an  algorithm  is  assumed  in  Step  3.  Steps  4-6  set  up 
the  base  set  for  checking  (via  resolution)  whether  a  given  wff  fc  is  a  ramification  of  G. 
Finally,  Steps  7-9  performs  a  single  resolution  step  on  every  candidate  wff  f  generated  so 
far,  outputting  any  /  for  which 

W  )=  Vii  . .  .V.t3  (G  D /).  (37) 

Note  that  since  a  procedure  checking  a  given  /  will  not  necessarily  halt  there  must  be 
interleaving  of  the  checking  of  formulas  of  a  given  length  with  generation  and  checking  of 
formulas  of  greater  length.  Procedure  Vux  guarantees  that  every  candidate  wff  f  eventually 
receives  an  unbounded  number  of  resolution  steps  in  attempting  to  prove  (37).  Thus  it  is 
guaranteed  that  for  a  given  /,  a  proof  of  (37)  will  eventually  be  found  if  one  exists. 

Procedure  V[iCX,  while  sound  and  complete,  is  a  very  poor  method  of  generating  ram¬ 
ifications.  Its  failing  is  that  it  docs  not  use  the  problem  at  hand  in  guiding  its  search  for 
ramifications,  nor  does  the  procedure  seem  easily  ammenaide  to  such  guidance.  In  order  to 
make  t  he  procedure  be  responsive  to  the  problem,  the  subgoal  and  known  facts  themselves 
must  be  used  to  guide  the  search  for  possible  ramifications. 

4.3  Natural  Deduction  on  Subgoals  ('P,v«0 

Probably  I  lie  best,  known  examples  of  deductively  complete5  systems  for  first-order  predicate 

calculus  are  various  natural  deduction  schemes,  for  example,  the  Gentzen  system  presented 

1  Deduct i vc  completeness  is  to  he  carefully  distinguished  from  njutulion  completeness.  A  system  is  dc- 
1 1 actively  complete  if  (or  any  proposition  entailed  by  a  set  of  propositions,  there  exists  a  deduction  of  that 
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in  Manna  [57],  page  108.  Let  us  suppose  a  world  model  W  consisting  of  a  finite  satisfiable 
set  of  closed  well-formed  formulas  of  first-order  predicate  calculus.  Let  G  be  a  closed  formula 
in  prenex-conjunctive  normal  form  with  matrix  G  and  prefix  dzi  . . .  OXg. 

The  procedure  V^at  follows.  Note  that  Step  2  is  only  partially  specified.  Even  though 
the  natural  deduction  system  is  complete,  nothing  has  been  stated  about  controlling  the 
natural  deduction  steps.  To  insure  that  every  possible  ramification  is  eventually  deduced, 
there  must  also  be  a  control  strategy  guaranteeing  that  every  possible  proof  will  eventually 
be  tried. 

The  Vsat  Procedure: 

1.  Replace  Variables  xj, . . .  ,xg  of  G  via  substitution  a  with  a  set  of  new  and  distinct 
constants  X\, . . . ,  Xg. 

2.  Apply  Natural  Deduction  Rules  to  W  U  {Gct} 

3.  Backsubstitute:  For  any  formula  N'  deduced,  return  N  =  Vt/i  . .  Vj/n(NV_1 ),  where 
{ 2/1 ,  •  -  - ,  yn}  is  the  set  of  free  variables  in  N'er-1 ,4 

The  replacement  of  the  variables  X\ . . . ,  xg  by  new  constants  is  needed  so  that,  the  natural 
deduction  will  not  deduce  “ramifications”  that  are  not  true  for  all  values  of  the  variables  of 
G  in  the  formulas  deduced.  Since  these  constants  are  arbitrary,  any  formula  N  deduced  is 
entailed  for  all  values  of  x\, . . .,  xg.  By  soundness  of  the  natural  deduction  system,  for  any 
formula  N'  derived, 

W,Gct  (=  N 

and  by  the  Deduction  Theorem, 

W  j=  (Ga  D  N). 

The  inverse  substitution  a~x  can  be  applied  to  nz  to  regain  the  original  fr»*e  variables  x\ . . . . ,  xa. 

Pt\ai  is  a  complete  method  for  finding  ramifications  of  a  subgoal.  It  is  far  better  than 
Viei  in  that  it  is  guided  by  the  database  W  at  hand.  However,  it  is  still  lacking  an  essential 
feature  -  it  is  not  sensitive  to  the  actual  goal  at  hand,  that  is,  it  will  just  as  readily  find 
implications  of  two  random  facts  in  the  database  as  it  will  combine  known  facts  with  the 
goal  in  order  to  produce  ramifications  that  depend  upon  the  goal.  But,  The  ramifications 
of  most  interest  are  those  which  depend  upon  the  goal. 

proposition.  A  system  is  refutation  complete  if  “false1-  can  be  deduced  from  every  unsatisfiable  set  of  formu¬ 
las.  Resolution  is  refutation  complete,  but  not  deductively  complete  For  example,  given  a  set  of  formulas 
{a,  -•a  V  6},  resolution  cannot  derive  the  formula  6  V  c  in  spile  of  the  fact  that.  {«,  -in  V  b)  entails  h  V  c. 

’It  is  assumed,  without  loss  of  generality,  that  the  set  of  variables  of  N '  and  of  G  are  disjoint  If  not,  the 
names  of  free  variables  of  N'  must  be  changed. 
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In  order  to  find  ramifications  that  are  consequences  of  the  goal  rather  than  the  database 
alone,  the  search  must  be  restricted  to  those  formulas  that  depend  upon  the  goal  for  support. 
Although  we  could  constrain  natural  deduction  to  make  only  those  deductions,  it  is  easier 
to  do  so  using  resolution.5 

4.4  Definitions  for  Resolution-Based  Forward  Reasoning 

The  Vbgc  and  Frps  procedures  use  binary  resolution  [77]  to  generate  ramifications.  In 
presenting  the  two  procedures,  the  definitions  and  terminology  of  this  section  will  be  useful. 

In  the  rest  of  this  chapter,  it  will  be  assumed  that  W  consists  of  a  finite  set  of  clauses, 
each  clause  being  implicitly  universally  quantified  over  each  of  its  variables.  Recall  that  a 
resolution  refutation 

Recall  that  aresolution  refutation  begins  by  creating  the  base  set ,  that  is,  WuCNF(-iG), 
where  CNF(/)  is  the  set  of  clauses  in  the  conjunctive  normal  form  of  /.  It  will  be  assumed 
that  unless  otherwise  specified,  all  set  of  support  deductions6  have  CNF(->G)  as  the  initial 
set  of  support.  Each  clause  C  in  the  set  of  support  is  the  denial  of  the  conjunction  S  = 
3yi  . . .  3yc-iC,  where  {yi, . . . ,  yc}  is  the  set  of  variables  in  C.  S  can  be  viewed  as  a  goal  to 
which  the  original  goal  G  has  been  reduced,  that  is,  if  we  can  find  a  solution  to  S  we  would 
also  have  a  solution  to  the  original  goal  G. 


Example  4.1  Suppose  u;i  =  -’Zebra(z)  V  Striped (zr)  6  W  and  G  =  Striped(.x ). 
CNF(-iG)  =  {  — > Striped(ar) }  and  via  resolution  of  this  clause  against  w i,  the 
clause  -iZebra(y)  would  be  derived.  ->Zebra(y)  is  the  denial  of  the  subgoal 
3yZebra(y)  to  which  we  have  reduced  the  original  goal  G. 


The  Extended  World  Model  W*  is  the  set  of  all  clauses  from  W,  all  clauses  with  all 
its  parents  in  W*,  and  no  other  clauses.  In  other  words,  W*  is  all  clauses  derived  strictly 
from  W.  and  as  such,  for  all  w  €  W*,  W  xv. 

The  Goal  Set  G*  includes  CN F(  — >G ),  any  clause  with  at  least  one  parent,  in  G".  and  no  i 

other  clauses.  G*  is  synonymous  with  the  set.  of  support  obtained  by  starting  with  CNF(-iG). 

Given  the  above  definitions,  every  deduction  step  on  base  set  W  U  CNF(-iG)  fits  into  J 

exactly  one  of  the  following  classes:  ' 

'Use  of  a  natural  deduction,  as  opposed  to  a  resolution-based  system,  does  not  pro  ludr  using  tin  goal  ] 

in  directing  the  search.  See,  for  example,  Boyer  and  Moore  or  Bledsoe  [I] 

'See  Section  2.1  for  definitions  of  set  of  support  ami  set  of  support  deduction.  , 
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1.  WG  Resolution  -  a  resolution  between  a  clause  of  G*  and  a  clause  of  W*  yielding 
a  new  clause  of  G*. 

2.  GG  Resolution  -  a  resolution  between  two  clauses  of  G*  yielding  a  new  clause  of  G*. 

3.  G  Factoring  -  factoring  a  clause  of  G*  yielding  a  new  clause  of  G*. 

4.  WW  Resolution  -  a  resolution  between  two  clauses  of  W*  yielding  a  new  clause  of 
W*. 

5.  W  Factoring  -  factoring  a  clause  of  W*  yielding  a  new  clause  of  W*. 

Standard  problem  solving  techniques  can  be  viewed  in  terms  of  these  resolution  types. 
Backwards  Reasoning  (Goal  Reduction)  consists  of  GG  and  WG  Resolution  and  G  Factoring 
Steps.  Forward  Reasoning  from  the  known  facts  of  a  problem  consists  of  WW  Resolution 
and  WW  Factoring  Steps.  Because  most  problems  are  such  that  goal  reduction  techniques 
search  a  smaller  part  of  the  space  than  forward  reasoning  from  the  known  facts,  we  often 
restrict  ourselves  to  backwards  reasoning  techniques  such  as  the  set  of  support  strategy  (of 
which  backwards  chaining  is  a  special  case).  While  generation  of  ramifications  is  a  form  of 
forward  reasoning  it  differs  from  the  common  usage  of  forward  reasoning  in  problem  solving 
in  a  crucial  way:  Rather  than  reason  forward  from  the  known  facts  for  an  entire  problem, 
ramifications  are  generated  by  reasoning  forward  from  goals  created  in  the  course  of  goal 
reduction. 

4.5  Resolution  on  Subgoal  Clauses  (Vrgc) 

Procedures  Vrgc  and  Prps  are  based  upon  binary  resolution  [77].  Traditionally,  resolution 
has  been  presented  as  a  method  for  refutation  of  a  set  of  clauses,  that  is,  a  proof  that  the  set 
of  clauses  is  unsatisfiablc.  Such  a  view  is  reinforced  by  resolution  being  refutation  complete, 
but  not  deductively  complete  —  resolution  can  deduce  false  from  any  unsatisfiable  set  of 
clauses,  but  cannot  deduce  all  sentences  (or  even  all  clauses)  entailed  by  a  set  of  clauses. 

In  practice,  however,  resolution  is  very  useful  as  a  deduction  rule  as  well.  As  was 
discussed  in  Chapter  2,  goal-directed  backwards  reasoning  can  be  performed  via  the  set 
of  support  restriction  on  resolution.  Further  ordering  of  the  allowed  resolution  steps  gives 
depth-first,  breadth-first  or  other  search  behavior.  The  main  reason  that  resolution  is  useful 
in  spite  of  its  lack  of  deductive  completeness  is  simple  —  the  .  entences  entailed  by  the  base 
set  but  not  deducible  using  resolution  are  generally  not  of  interest.  As  was  seen  in  Chapter  2 
and  as  will  be  seen  in  the  following  sections,  if  resolution  cannot  derive  a  proposition  P, 
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it  can  derive  a  set  of  clauses  which  together  are  at  least  as  useful  as  P.  In  backwards 
reasoning,  useful  will  mean  weaker ,  and  in  forward  reasoning  stronger. 

The  RGC  Procedure  Let  G  be  a  closed  formula  in  prenex-conjunctive  normal  form  with 
matrix  G  and  prefix  Ox\  . .  .Oxg.  Procedure  Vrgc  is  9-s  follows: 

1.  Replace  Variables  ij,. .  ,,xg  of  G  via  substitution  a  with  a  set  of  new  and  distinct 
constants  X\, . . . ,  Xg. 

2.  Perform  Set  of  Support  Resolution7  on  base  set  W  U  {Gtr}  using  the  conjuncts 
of  Go  as  the  initial  set  of  support. 

3.  Backsubstitute:  For  any  formula  N'  deduced,  return  N  =  Vy*  . .  .Vy^N'cr-1),  where 
{j/i  >  •  •  •  >  Vn)  is  the  set  of  free  variables  in  N'a-1 .8 

Let  us  illustrate  Vrgc  with  a  simple  example: 

Example  4.2  Let  G  =  3z3y(A(.T)  A  -iB (x,y)  A  C (y))  be  the  original  goal,  and 
let  W  =  {  -iA(z)  V  E(z,  v)  V  -^(z,  u),  ->D (q,r)  V  — >C(r )  V  H(r,  s),  l(t)}- 

1.  Replace  Variables:  Let  6  =  {x  <—  X,y  <—  V'}  yielding  GO  =  A(X)  A 
->B(vY,  Y)  A  c(y) 

2.  Perform  Set  of  Support  Resolution  on  base  set  Wll{A(X),  -iB(.Y,Y),  C(Y)} 
and  initial  set  of  support  (A(X),  ~>B(A',  Y),  C(Y )}.  The  resolution  steps 

in  the  deduction  tree  below  illustrate  the  derivation  of  the  clause  N'  — 

E(A',  Y)  V  H(Y,  m). 


->D(q.r)  V -.C(r)  v  H(r,  »)  C{  Y )  A(X) 

3.  Backsubstition  into  E(.Y,  Y)VH(Y,  m),  yields  the  ramification  N  =  VmE(r,|/)V 

H 

'See  Section  2.5  1  for  definition  of  resolution,  set  of  support,  and  deductions. 

6It  is  assumed,  without  loss  of  generality,  that  the  set  of  variables  of  N'  and  of  G  are  disjoint.  If  not,,  the 
names  of  free  variables  of  N'  must  be  changed. 
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As  was  the  case  with  V^at  replacement  of  the  variables  of  G’s  prefix  prevents  the  res¬ 
olution  steps  from  deriving  ramifications  that  are  ramifications  for  any  values  of  the  free 
variables  of  G.  For  example,  consider  a  goal  3zA(z)  and  a  world  model  W  containing  the 
clause  -iA(4)  V  D(4).  Without  substituting  some  new  ground  variable  for  the  x  in  A(z),  one 
might  think  that  D(4)  is  a  ramification  of  A(x).  Indeed  it  is  not,  since 

~.A(4)  V  0(4)  V  z  (A(z)  D  D(4)). 

4.5.1  Soundness  of  Vrgc 

The  soundness  of  Vrgc  is  an  immediate  consequence  of  the  soundness  of  the  resolution 
procedure. 

Theorem  4.1  (Soundness  of  Vrgc)  For  any  goal  G,  if  Vrgc  returns  N  then  N  is  a  rami¬ 
fication  of  G. 

Proof:  Vrgc  starts  with  base  set  W  U  Go.  Note  that  since  Go  is  a  set  of 
ground  clauses,  it  is  already  in  conjunct  normal  form.  By  the  soundness  of  the 
resolution  and  factoring  rules  (See,  for  example,  Theorem  5.1,  page  72  of  Chang 
and  Lee  [15]),  we  know  that  any  clause  N'  derived  via  the  resolution  procedure 
is  a  logical  consequence  of  the  base  set,  i.e., 

WuGct  [=  N\  (38) 

or  by  the  Deduction  Theorem, 

W  |=  Go  D  N'.  (39) 


Since  a  is  a  ground  substitution  to  arbitrary  constants  that  do  appear  in  W , 
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4.5.2  Completeness  of  Vrgc 

The  following  theorem  is  the  basic  completeness  result  for  Vrgc'- 
Theorem  4.2  (Completeness  of  Vrgc )9  Suppose  we  are  given 

1.  W,  a  satisfiable  set  of  clauses,  implicitly  universally  quantified 

2 .  G  =  a  set  of  ground  literals,  and 

3.  N  =  rii  V  . . .  V  rt„,  a  clause  with  variables  {j/i, . . . ,  ym) 

such  that  W  ft  (gj  A  ...  A  gp)  D  Vyi . . .  Vt/mN  and  such  that  W  ^  N,  then  there  exists 
a  set  of  support  deduction  of  clause  M  from  base  set  W  U  G  with  initial  set  of  support  G 
such  that  M  subsumes  N. 


Proof  Since  W  (=  (gx  A  . . .  A  gg )  D  . . .VymN,  then  for  new  and  distinct 
constants  >j, . . .  ,Ym  and  substitution  6  —  {yi  — *•  Y\, . . . ,  ym  — ►  Ym},  it  must  be 
thatWuG  |=  N0.  Thus,  the  set  WuGu{-mi0, . .  .->nm0}  is  unsatisfiable,  and 
by  Herbrand’s  Theorem,  there  exists  a  finite  set  C  of  ground  instances  of  the 
above  set  that  is  unsatisfiable.  Let  be  the  set  of  ground  instances  of  clause 
of  W  that  are  in  C,  and  similarly  define  Cg  C  G  and  C„  C  {~mi0, . .  . ->nm0}. 

By  hypothesis,  W  ^  N  and  therefore  W  ft  NO,  since  the  Yt  are  arbitrary.  Thus 
Cw  U  Cn  is  satisfiable,  and  so  there  must  exist  a  set  of  support  refutation  of  C 
from  initial  set  of  support  Cg.  By  Lemmas  2.12  and  2.13,  this  refutation  can  be 
converted  to  a  set  of  support  deduction  of  a  clause  M'  with  base  set  Cw  U  Cg, 
initial  set  of  support  Cs,  and  such  that  M'  C  N#.  We  now  use  the  Lifing  Lemma 
to  build  another  deduction  in  which  the  clauses  from  Gw  are  replaced  by  the 
corresponding  clauses  from  W.  This  builds  a  set  of  support  deduction  of  a 
clause  M  from  W  U  G  and  initial  set  of  support  G,  and  such  that  M p  -  M'  for 
some  p.  So,  we  have  Mp  C  N0,  but  since  0  is  invertible  (i.e.,  the  Y,  are  distinct), 

M</>  C  N,  where  <p  —  p0~^ .  Thus,  there  exists  a  set  of  support  deduction  of 
a  clause  M  from  base  set  W  U  G.  initial  set  of  support  G.  and  such  that  M 
subsumes  N.  I 

The  above  result  does  not  say  that  Vrgc  can  deduce  any  ramification  of  any  conjunct  ive 
goal.  Instead,  it  says  that  if  it  cannot  deduce  that  ramification,  it  will  deduce  one  that  is 
at  least  as  powerful.  Before  discussing  this,  a  word  on  clausal  form  is  in  order: 

3This  theorem,  though  found  independently,  is  a  variation  of  theorems  proven  by  Lee  [50]  and  Mini; 07/i 
and  (toiler  [70],  See  Section  4.?. 8. 
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An  arbitrary  formula  is  not  necessarily  equivalent  to  its  conjunct  normal-form.  The  lack 
of  equivalence  arises  due  to  Skolemization. 

Example  4.3  A(K)  is  a  conjunct  normal-form  for  the  formula  3xA(x),  where  K 
is  a  Skolem  constant.  Suppose  the  domain  is  the  set  of  natural  numbers  and  A(0) 
is  valid,  but  ->A (y)  for  all  y  >  0.  Then  A(K)  is  false  under  any  interpretation 
in  which  I(  is  assigned  a  non-zero  constant,  but  3xA(x)  can  still  be  true  under 
such  an  interpretation. 

Note  also  that  the  conjunct  normal- form  for  a  given  formula  is  not  unique  —  one  can 
choose  any  new  name  for  Skolem  functions. 

What  Theorem  4.2  says  is  that  for  some  way  of  converting  an  arbitrary  formula  N  to 
conjunct  normal-form,  Vrgc  can  derive  a  set  of  clauses  N1  such  that  each  clause  in  the 
conjunct  normal- form  of  N  is  subsumed  by  a  clause  of  N'. 

One  might  question  whether  finding  a  subsuming  clause  is  of  value.  After  all,  the  original 
goal  itself  is  “at  least  as  powerful”  as  the  ramification  in  the  sense  that  the  necessasry 
constraint  can  be  derived  from  it.  It  turns  out,  however,  that  subsumption  is  just  the 
desired  relationship  between  an  arbitrary  clause  N  and  a  clause  N1  derived  by  Vrgc •  The 
subsumed  clause  N  can  differ  from  N'  only  in  (1)  having  more  disjuncts  (literals;,  and 
(2)  having  some  variables  in  N'  replaced  by  constrants  in  N.  It  immediately  follows  that, 
the  subsumed  clause  A  is  a  ramification  if  N'  is  a  ramification.  It  is  preferable  to  know 
that  A  is  a  ramification  of  G  rather  than  that  A  V  B  is  a  ramification.  Similarly,  it  is 
preferable  to  know  that  VyA(y)  is  a  ramification  of  G  rather  than  to  know  only  that  A(4) 
is  a  ramification  of  G.  Stated  differently  and  only  slightly  inaccurately,  Vrgc  does  can  find 
arbitrary  ramifications;  it  simply  eliminates  unnecessary  variable  bindings  and  unnecessary 
disjuncts. 

4.5.3  Caching  the  Results  of  Vrgc 

In  performing  goal  reduction  on  some  goal  G,  it  is  likely  that  one  will  encounter  many 
similar  subgoals.  In  terms  of  resolution,  many  clauses  in  the  set  of  support  will  have  sets 
of  literals  in  common.10  As  a  result  some  of  the  ramifications  of  one  subgoal  clause  will 
often  be  the  same11  as  the  ramifications  of  other  clauses;  it  would  be  wasteful  to  generate 
these  ramifications  from  scratch  for  each  similar  subgoal  clause.  In  addition,  it  is  usually 
impossible  to  compute  all  the  ramifications  of  a  subgoal.  For  a  given  subgoal  clause  it  is 

10 Modulo  variable  names 
11  Again  modulo  variable  names 
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desirable  to  store  the  results  to  date  in  order  to  begin  the  search  anew  sometime  later.  As 
will  be  seen  in  Section  4.7,  a  third  reason  to  cache  ramifications  is  that  subgoals  of  a  goal 
usually  inherit  all  the  ramifications  of  that  goal. 

Fortunately,  there  is  a  straightforward  way  to  cache  such  results,  namely,  in  W  itself. 
By  definition,  if  clause  N  is  a  ramification  of  conjunction  G  =  Gi  A  . . .  A  Gg,  then 

W  (=  -.Gi  V  ...  V  -iGff  V  N, 

and  so  ->Gi  V  ...  V  ->Gg  V  N  can  be  added  to  W. 

If  the  ramification  follows  from  a  proper  subset  G'  of  the  conjuncts  of  G  it  is  preferable 
to  note  this  fact  in  W  via  a  clause  ->G'  V  N  instead  of  -iG  V  N. 
et  us  refer  to  G'  as  a  foundation  of  N,  that  is, 

Definition  4.1  For  a  conjunctive  goal  G  and  a  clause  N,  where  N  is  a  ramification  of  G,  a 
a  foundation  of  N  is  any  subset  G'  of  the  conjunction  G  such  that  W  (=  — <G7  V  N. 

One  would  like  to  require  a  foundation  to  be  minimal,  that  is,  that  no  subset  of  a  foundation 
is  also  a  foundation  of  that  ramification.  Unfortunately,  the  problem  of  determining  whether 
a  given  foundation  is  minimal  is  semi-decidable,  and  minimality  will  not  be  required  in  the 
discussion  that  follows. 

A  given  deduction  of  a  ramification  N  from  a  goal  G  may  not  involve  all  of  the  conjuncts 
of  G.  While  minimality  is  too  strong  a  condition  to  require,  it  is  easy  to  note  only  the 
conjuncts  of  G  that  were  actually  needed  in  the  given  deduction  of  G.  To  do  so,  Vrgc  can 
be  modified  to  record  such  a  foundation  of  each  ramification  deduced.  Let  us  notate  the 
foundation  of  a  clause  i  as  Fj.  The  modified  procedure  is: 

1.  Replace  Variables  a:*, . . . , x9  of  G  via  substitution  o  with  a  set  of  new  and  distinct 
constants  A"i , . . . ,  Xy. 


> 


2.  Record  Initial  Foundations  of  clauses  from  W  and  literals  of  Go.  For  every 
clause  w  of  W,  Fw  ~  {}.  For  every  literal  g  in  Go,  Fg  =  g. 

3.  Perform  Set  of  Support  Resolution  on  base  set  W  U  {Go}  using  the  con  juncts  of 
Go  as  the  initial  set  of  support.  For  each  clause  v  deduced,  if  n  has  only  one  parent  /-. 
then  Irn  =  Fp.  If  n  has  parents  p  and  q.  then  Fn  —  Fp  U  Fr/. 

4.  Backsubstitute:  For  any  formula  N'  deduced,  return  N  =  Vt/i  . . .  V  i/M (  N'o-1 ),  where 
{ 2/i , . . . ,  ijn}  is  the  set  of  free  variables  in  N'o-1 .  In  addition,  the  clause  ((V  -i  /yv  )VN  )<p 
may  be  added  to  W,  where  <p  is  a  uniform  renaming  of  the  variables  in  ('/  -i /y\><)  V  N. 
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Example  4.4  Suppose  we  are  given 

G  =  {A(ff),B(»,z)} 

Wi  =  -iA(m)  V  C(m,n) 
w2  =  -iC(o,p)  V  D(o) 
wlfW2  €  W. 

The  substitution  {y  —>  Y,z  ->  Z}  is  used  to  replace  the  variables  of  G.  When 
Vrgc  generates  N\  =  C (Y,n),  F^  is  {A(T)}  so  -<A(q)W  C(q,r)  can  be  added  to 
W,  where  q  and  r  are  new  variables.  Similarly,  upon  generation  of  N2  =  D(T ), 
-iA(s)  V  D(s)  can  be  added  to  W. 


4.6  Resolution  with  Partial  Subsumption  ( Vrps ) 

The  Vrgc  Procedure  of  Section  4.5  takes  a  conjunction  of  literals  G  =  g,  A  . . .  A  g9  and 
reasons  forward  from  it  in  order  to  generate  ramifications.  Unfortunately,  G  does  not  appear 
in  the  normal  course  of  resolution  (or  resolution-based  residue  techniques);  its  negation,12 
the  clause  C  =  -iG  appears  in  the  set  of  support.  In  order  to  use  Vrgc  a  new  base  set 
BrgC  =  W  U  ->Ccr  must  be  constructed  (where  a  is  a  substitution  replacing  variables  of  G 
with  new  and  distinct  constants)  on  which  to  perform  resolution. 

By  a  slight  modification  of  Vrgc  backward  reasoning  (goal  reduction)  and  forward 
reasoning  (generation  of  ramifications)  can  be  performed  without  re-negation  of  goal  clauses 
and  using  the  same  resolution  inference  engine.  The  modified  technique  will  be  called 
“Resolution  with  Partial  Subsumption”13  or  Vrps- 

Vrps  is  a  severely  restriction  on  set  of  WW-Resolution  and  W- Factoring  steps,  gener¬ 
ating  the  same  set  of  ramifications  ramifications  as  were  generated  by  Vrgc ■  Instead  of 
directly  generating  the  ramifications  as  does  Vrgc->  Vrps  generates  clauses  of  the  form 

P  =  ci  V  ...Vcm  Vnx  V  . . .  V  n„, 

where  Pc  =  Cj  V  . . .  V  cm  subsumes  the  negation  of  a  G  and  P/v  =  nj  V  ...  V  nm  is  a 
ramification  of  G.  As  stated  above,  if  resolution  is  being  used  for  goal  reduction  (as  in 
Resolution  Residue),  the  conjunct  G  will  not  explicitly  appear,  but  rather  its  negation. 

12More  accurately,  the  disjunction  of  the  complement  of  each  of  its  conjnncts. 

,3The  name  “Partial  Subsumption”  has  been  used  to  show  the  similarity  to  Chakravarthy's  use  of  the 
same  term  (Chakravarthy  [12,14,13]). 


4.6.  RESOLUTION  WITH  PARTIAL  SUBSUMPTION  (VRPS) 


85 


Example  4.5  Suppose  W  contains  clauses  -iAvB  and  ->BvC.  Given  G  =  AaZ, 

Vrgc  it  negates  the  clause  -iA  V  ->Z  that  actually  appeared  and  derives  the 
ramification  C.  In  contrast,  Vrps  simply  resolves  the  two  clauses  of  W*  together 
producing  a  new  clause  -«A  V  C  stating  that  C  is  a  ramification  of  any  subgoal 
containing  the  conjunct  A. 

The  above  scheme  has  the  advantage  that  the  same  database  can  be  used  for  both 
forward  and  backward  reasoning.  In  addition,  if  the  results  of  forward  reasoning  happen  to 
apply  to  more  than  one  subgoal,  they  will  already  be  present  without  any  explicit  caching 
mechanism.  Let  us  now  explore  Vrps  iu  more  detail. 

4.6.1  The  Vrps  Procedure 

Suppose  that  W  is  a  satisfiable  set  of  clauses,  and  that  there  exists  a  goal  G  =  Gj  A . . .  A  Gm , 
where  the  G^  are  literals.  In  other  words,  -iG  £  G*.  Let  a  be  a  substitution  replacing  each 
variable  of  G  with  a  new  and  distinct  constant. 

The  Vrps  procedure  is  stated  as  the  following  restriction  on  resolution  as  follows: 

For  Cg  €  G*,  Vrps  starts  with  base  set  W  and  allows  any  resolution  (or 
factoring)  steps  such  that  the  resolvent  (factor)  contains  a  literal  L  that  unifies 
with  a  literal  of  Cgo. 

Note  that  it  is  impossible  that  a  resolvent  (or  factor)  C  have  a  literal  that  unifies  with 
a  literal  of  Cg  unless  the  same  is  true  for  at  least  one  parent  of  C . 

Example  4.6  Given  goal  G  =  A(x)AB(x)AC(x),  a  —  {x  — ►  A'}  and  w\ , . . . .  te5  6 
W,  where 

=  -iA(z)VD(z) 

V’2  =  iD(y)  V  E(v) 

r«3  =  ->D(4)vF(4) 

=  -iA(tt)  V  ->B(u)  V  H ( if. ) 

U>5  =  -4(u)  V  A(u), 

Vrps  can  resolve  W\  and  w2  and  add  the  resolvent  ay  =  — > A(  / )  V  E ( / ) . 
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Vrps  can  not  resolve  w\  and  W3  because  the  resolvent  -iA(4)  V  F(4)  has  no 
literals  that  unify  with  a  literal  in  -iGct. 

Vrps  can  not  resolve  w\  and  uj5  because  the  resolvent  ->l(s)  V  D(s)  has  no  literals 
that  unify  with  a  literal  in  -iG a . 

Vrps  can  resolve  W4  and  w 5  —  the  resolvent  — <l(r )  V  -iB(r)  V  H(r)  has  a  literal 
that  unities  with  a  literal  in  ->G<r. 

It  is  tempting  to  try  to  use  a  stronger  restriction,  namely  that  if  a  parent  has  a  literal  that 
unifies  with  a  literal  in  Cq,  then  the  resolvent  (factor)  must  also  have  a  literal  that  unifies 
with  that  literal  of  Cg ■  It  turns  out  that  this  is  too  strong  a  restriction  for  completeness. 

4.6.2  Soundness  of  Vrps 

Vrps  is  sound  in  that  each  clause  deduced  is  the  proof  of  a  given  ramification. 

Theorem  4.3  (Soundness  of  Vrps )  Suppose 

1.  W  is  a  satisfiable  set  of  clauses, 

2.  G  '=i(/iV...V  -<gg  €  G* 

3.  C  =  Ci  V  . . .  V  cq  is  a  clause  deduced  by  Vrps- 

Then  there  exists  a  substitution  9  and  some  non-empty  C  C  C  such  that  C'6  C  G'  and 

W  (=  (/\(iC')D  \J{C-C'). 

Proof:  The  proof  is  quite  straightforward.  Let  o  be  the  substitution  replacing 
the  variables  of  G  by  new  and  distinct  constants.  Since  Vrps  performs  resolution 
on  base  set  W,  any  clause  C  generated  by  Vrps  is  such  that  W  C.  So,  for 
any  C'  C  C, 

w  b  (A(^c")  ^  W-  -  c")- 

Furthermore,  since  every  clause  C  generated  bv  Vrps  contains  at  least  one 
literal  that  unifies  with  a  literal  from  G'cr  it.  is  guaranteed  that  there  is  some 
substitution  9  such  that  at  least  one  of  the  literals  in  C9  is  a  literal  of  G'.  Thus, 

C  need  not  be  emptv.  I 
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4.6.3  Completeness  of  Vrps 

The  basic  completeness  result  for  Vrps  is  similar  to  that  for  Vrgc  —  not  every  ramifica¬ 
tion  N  can  be  deduced,  but  ramifications  that  subsume  every  clause  in  a  conjunct  normal 
form  of  N  are  deducible  by  Vrps •  In  addition,  the  deduced  clauses  tell  which  conjuncts 
of  G  were  needed  for  the  deduction. 

The  completeness  result  for  Vrps  is  Theorem  4.7,  but  prior  to  proving  it,  some  prelim¬ 
inary  results  are  needed. 

Lemma  4.4  (Bubble  Lemma)  Suppose  that  D'  is  a  binary  resolvent  nf  clauses  C[  and  C'7. 
Suppose  further  that  C\  =  C[4>\1  UM  and  Ci  =  C'^fa1  UN  for  some  substitution  fa  and  fa, 
and  sets  of  literals  M  and  N .  Then  C\  and  Ci  (or  factors  of  C\  and/or  Ci)  have  a  binary 
resolvent  D  such  that  for  some  9,  D1  C  D6,  and  DO  —  D1  C  (M  U  N)9. 

Proof  By  the  Lifting  Lemma  (Lemma  2.4),  the  clauses  C and  Cifa1  (or 
facotrs  of  these  clauses)  have  a  resolvent  Q  such  that  D1  is  an  instance  of  Q ,  that 
is,  for  some  substitution  6 ,  QO  —  D’.  The  addition  of  literals  M  to  clauses  C\fal 
and  N  to  Cifax  does  not  change  the  fact  that  0  is  still  an  mgu  for  literals 
in  Cxfa1  and  Cifa1,  and  thus  clauses  C\  and  Ci  (or  factors  of  these  clauses) 
must  also  have  a  resolvent  D  via  mgu  9.  All  literals  present  in  Q  will  also  be 
present  in  D ,  so  D'  C  DO.  The  additional  literals  of  D  will  be  from  ( M  U  N)9. 
so  DO  -  D1  C  ( M  U  N)9.  Note  that  some  of  the  literals  of  (M  U  N)9  might  be 
identical  to  literals  of  Q,  and  thus  it  is  not  correct  to  say  (D  —  (M  U  N))9  =  D'. 

I 

Lemma  4.5  (Bubble  Lemma  (Factoring))  Suppose  that  D1  is  a  factor  of  clause  C’. 
Suppose  further  that  clause  C  =  Cp~x  U  M  for  some  substitution  p.  and  a  set  of  literals  M . 
Then  C  has  a  factor  1 D  such  that  for  some  9,  D'  C  DO,  and  DO  —  D'  C  MO. 

Proof  By  hypothesis,  there  exists  some  substitution  p ,  a.  most  general  unifier 
of  two  or  more  literals  of  C'.  Since  ( C  -  M)<p  =  C' ,  it  must  be  the  case  that 
the  substitution  ipp  unifies  two  or  more  literals  of  C,  and  thus  D'  C  D^pp  and 
Dipp  —  D'  C  Mpp.  Even  if  < pp  is  not  an  mgu  of  the  subset  of  literals  from  C. 
the  existence  of  pp  implies  the  existence  of  an  mgu  9  with  the  above  properties. 


Theorem  4.6  (Bubble  Theorem)  Let  M  be  a  satisfiable  set  of  clauses  and  U  be  a  sat- 
isfiable  set  of  unit  chaises  such  that  there  exists  a  set  of  support  deduction  D  of  a  ctniisi  .V' 
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from  base  set  MUU  and  initial  set  of  support  U.  Let  S  be  the  smallest  subset  o/M  contain¬ 
ing  all  clauses  with  a  literal  L  such  that  -> L  unifies  with  some  u  £  U.  Then  there  exists  a 
set  of  support  deduction  of  a  clause  C  =  N  U  P  from  base  set  M  and  initial  set  of  support  S, 
where 

1.  N'  is  an  instance  of  N ,  and 

2.  P  subsumes  the  clause  P'  =  \f  ->u  for  all  u  £  U. 

Proof:  Since  U  is  satisfiable,  no  two  clauses  of  U  can  resolve  against  each  other. 
Therefore  every  clause  from  U  in  the  fringe  of  D  is  resolved  against  a  clause  of  M. 
Consider  anew  deduction  tree  D'  with  a  fringe  Fringe(D')  =  Fringe(Z?)  —  U.  For 
every  subtree  of  D ,  there  will  be  a  corresponding  deduction,  i.e.,  subtree  of  D' 
for  which  the  theorem  holds.  The  proof  is  by  induction  on  the  height  n  of  the 
subtrees  of  deduction  tree  D' . 

Base  Case  ( n  =  1):  Since  D  is  a  set  of  support  deduction  using  nodes  from 
U  as  the  initial  set  of  support,  every  clause  m  in  the  fringe  of  D'  is  a  clause  that 
resolved  against  a  clause  of  U  in  D.  Thus  the  clause  m  contains  a  literals  whose 
negation  unifies  with  some  u  €  U.  In  addition,  m  =  n\Jp,  where  p  is  a  singleton 
set  that  subsumes  P'  (in  other  words,  p  contains  the  literal  that  resolved  against 
the  unit  clause  of  U),  and  nf)  —  r,  where  r  is  the  resolvent  of  m  and  a  clause 
from  U  in  D. 

Induction  Step  (n  =  Assume  that  the  theorem  holds  for  all  subtrees 
of  height  k  —  1  or  less.  Each  subtree  of  height  k  was  created  either  via  a  binary 
resolution  or  via  a  factoring  step.  If  the  step  was  a  binary  resolution  step,  by 
Lemma  4.4,  there  exists  a  resolvent  C  =  N  U  P  such  that  (1)  the  corresponding 
node  in  D  is  an  instance  of  N  and  (2)  there  exists  a  substitution  o  such  that 
Po  C  P' .  Since  by  hypothesis  at  least  one  of  the  parent  clauses  of  C  was  in 
the  set  of  support,  then  so  is  C .  Simlarly,  if  the  step  was  a  factoring  step,  then 
by  Lemma  4.5  and  a  similar  argument,  the  factor  is  in  the  set  of  suppport,  the 
corresponding  node  of  D  is  an  instance  of  some  of  the  literals  of  the  clause,  and 
the  remaining  literals  subsume  P' .  I 

The  major  result  of  this  section  can  now  be  stated  and  proven: 

Theorem  4.7  (Completeness  oiVpps)  Suppose 


1.  W  is  a  satisfiable  set  of  clauses, 

2.  G  =  gj  A  . . . ,  Ag3  is  a  conjunction  of  literals,  such  that  W  U  {G}  is  satisfiable. 
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3.  N  =  nx  V . . .  V  n„  is  a  clause  and  a  ramification  of  G  given  W,  and  such  that  W  N. 

There  exists  a  set  of  support  deduction  D  of  a  clause  M  =  Mg  U  Ms  from  base  set  W  and 
initial  set  of  support  S,  where 

1.  Mg  subsumes  -iG, 

2.  Ms  subsumes  N,  and 

3.  S  contains  exactly  those  clauses  of  W  having  a  literal  l  such  that  ->l  unifies  with 
some  g,-. 

4-  Every  non-fringe  clause  in  D  contains  at  least  one  literal  l  such  that  -> l  unifies  with 
some  g^. 


Proof: 

Let  (Tg  =  {xi  — ►  Xi,...,xg  — *•  X3),  where  {.Tj , . . . , xg)  is  the  set  of  variables 
in  G,  and  {X\, . .  .,Xg}  is  a  set  of  new  and  distinct  constants. 

Let  <rn  =  {j/i  — >  Y\,...,yn  —  Yn } ,  where  {yi,...,y„}  is  the  set  of  variables 
in  N<t3,  and  {Yi, . . .  ,  Yn}  is  a  set  of  new  and  distinct  constants. 

Let  G  =  {g^, . . .  ,g gOg]  and  let  N'  =  {-.n iaacrn, . . . ,  -mn<rs(7n}. 

Since  N  is  a  ramification  of  G,  and  because  the  Xx  and  Yx  are  distinct  and 
arbitrary, 

W  t=  (gi<7g  A  ...Ag3CTs)  D  N<7sct„, 

and  therefore  the  set  W  u  G  U  N'  is  unsatisfiable.  By  Herbrand’s  Theorem, 
there  must  exist  a  finite  set  H  of  ground  instances  of  W  U  G  U  N'  that  is  also 
unsatisfiable.  Since  W  U  N'  and  W  U  G  are  both  satisfiable,  H  must  contain 
clauses  both  from  G  and  N'.  Let  the  set  of  clauses  in  H  from  W,  G  and  N'  be 
notated  as  W,  G  and  N',  respectively. 

Constuction  0:  Let  Wq  be  the  subset  of  W  constructed  by  removing  from  W 
every  clause  g  U  to,  where  g  EG,  and  let  Ho  =  W0  UGuN',  Since  Ho  contains 
all  the  unit  ground  clauses  g  E  G,  Ho  {=  H,  and  Ho  is  also  unsatisfiable.  Since 
WtlN'  is  satisfiable,  so  is  WoU  N\  and  so  by  Theorem  4.2,  there  must  then  be 
a  set  of  support  deduction  Do  of  a  clause  No  from  base  set  W0  U  G  and  initial 
set  of  support  G  such  that  N0  subsumes  (is  a  subset  of)  N ogon. 

Constuction  1:  Let  W qg  be  the  subset  of  W0  containing  all  clauses  that  con¬ 
tain  a  literal  l  such  that  -d  E  G.  Based  on  deduction  D0  and  Theorem  4.0, 
there  also  exists  a  set  of  support  deduction  D\  of  a  clause  Cj  =  P]  U  Ni  from 
base  set  W0  and  initial  set  of  support  W og>  where  Nj  subsumes  No  (and  there¬ 
fore  subsumes  N <JgcTn)  and  Pj  subsumes  Vi^gi-  f°r  all  g,  E  G.  Furthermore, 
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since  Wo  contains  no  clauses  with  a  literal  l  6  G,  every  non-fringe  clause  of  Dy 
is  guaranteed  to  contain  a  literal  l  such  that  ->/  6  G. 

Lifting:  The  Lifting  Lemma  can  be  used  to  turn  Deduction  D\  into  the  desired 
Deduction  D.  Each  node  in  the  fringe  of  Deduction  D\  can  be  replaced  by  the 
corresponding  clause  in  W.  By  induction  on  subtrees,  the  Lifting  Lemma  can 
be  shown  to  lift  these  variables  to  the  root,  deriving  a  clause  M .  Since  each 
clause  of  D\  contains  a  literal  /  such  that  ->/  £  G,  each  non-fringe  clause  of  D 
contains  a  literal  whose  complement  unifies  with  a  clause  of  G.  Furthermore, 
since  the  X{  do  not  appear  in  W,  each  non-fringe  clause  of  D  contains  a  literal  l 
whose  complement  unifies  with  a  conjunct  of  G. 

By  Theorem  4.6  it  is  also  the  case  that  Nx  is  an  instance  of  Mpj,  that  is, 
there  exists  a  6  such  that 


MnO  =  Ni  C  Ncr0an. 

But  since  (<rgan)  is  invertible, 

MN6(agan)~ 1  C  N, 
in  other  words,  M/v  subsumes  N.  I 

4.7  Inheritance  of  Ramifications 

So  far,  deduction  of  ramifications  of  a  single  goal  has  been  considered.  In  practice,  however, 
finding  ramifications  and  goal  reduction  via  backwards  reasoning  are  interleaved.  Instead 
of  just  wanting  to  know  ramifications  of  a  single  goal,  it  would  also  be  useful  to  know 
whether  ramifications  of  a  goal  Gi  are  still  valid  for  goals  further  down  in  the  backwards- 
reasoning  deduction  tree.  If  so,  any  work  done  in  finding  ramifications  for  a  goal  Gi  need 
not  be  repeated  to  find  the  same  ramification  for  goal  G2.  Furthermore,  to  find  additional 
ramifications  for  G2,  forward  reasoning  could  start  from  the  inherited  ramification  rather 
than  from  scratch. 

Although  it  seems  reasonable  that  ramifications  can  be  inherited,  it  is  not  always  the 
case  that  a  ramification  of  one  goal  is  a  ramification  of  its  offspring.  One  reason  is  the 
renaming  and  binding  of  variables,  but  that  is  easily  taken  care  of.  The  other  reason  is 
due  to  somewhat  pathological  cases  involving  the  merging  of  two  goals  (GG- Resolutions, 
in  the  terminology  of  Section  4.4).  In  this  section,  a  precise  formulation  is  given  for  what 
ramifications  may  be  gleaned  from  ramifications  of  an  ancestor  goal. 
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Usually,  ramifications  are  inherited  from  parent  nodes  (modulo  variable  substitutions). 
It  is  easy  to  see  why  this  is  so:  Suppose  a  goal  Gi  gives  rise  to  a  new  goal  G2  via  backward 
reasoning.  Suppose  also  that  Gi  has  ramification  N.  Since  G2  was  derived  from  Gi  via 
backwards  reasoning,  then  Gi  could  be  derived  from  G?  by  forward  reasoning.  But,  since  N 
was  derived  from  Gi  via  forward  reasoning  as  well,  . 


r 

— ►  Oi 


FR 


N, 


that  is,  N  should  be  derivable  from  both  Gj  and  G2  by  forward  reasoning. 


Example  4.7  Consider  a  goal  Gi  =  C  A  D.  If  C  D  N,  then  N  is  a  ramification 
on  Gj .  Suppose  now  that  A  A  B  D  C  is  applied  to  reduce  Gi  to  G2  =  A  A  B  A  D. 
Clearly,  since  A  A  B  D  C  and  C  D  N,  then  A  A  B  D  N,  and  so  the  ramification  N 
is  inherited  by  G2. 

Let  us  now  consider  inheritance  of  ramifications  for  clauses  in  G*  for  various  possible 
steps  of  a  resolution  refutation  (See  Section  4.4  for  explanation  of  terms  G*,  W*,  WG 
Resolution,  GG  Resolution,  etc.).  Consider  a  clause  Cq  €  G*,  that  is,  clauses  from  the 
negation  of  the  original  goal  G  has  been  reduced  via  0  or  more  resolution  and  factoring  steps 
to  a  clause  Cc-  Let  Gj(y)  =  -> C’g ■  Suppose  also  that  Gi  has  some  ramification  fil.  If  Gj  is 
further  reduced  to  some  new  goal  G2,  will  fil  also  be  a  ramification  (modulo  variable  names) 
of  G2?  The  next  three  subsections  address  that  issue  with  a  series  of  theorems  which  follow 
easily  from  the  definitions  of  resolution  and  ramification. 

4.7.1  Inheritance  under  WG-Resolution  Steps 

The  following  theorem  holds  for  WG  Resolution  steps: 

Theorem  4.8  Suppose  Cq  is  a  clause  from  G*,  C\v  is  a  clause  from  W‘,  and  Cr  is  a 
resolvent  of  Cc  andC'w  Vla  unifier  a.  //fil  is  a  ramification  of  ~<Cg  then  filer  is  a  ramification 
of  Cr. 

Proof:  Let  us  represent  the  various  clauses  as  follows: 

CG  =  -<£<71  V...V->LGn  (40) 

Cw  =  ~*Lw\  V  ...  V  -iL\ym  (41) 

Cro  -  ->Lg2°  v  ...  V  ->L(7, ,a  V  -> L\y2a  V  ...  V  -> L\vma,  (42) 
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where  Lij  represents  a  positive  or  negative  literal.  Note  that  in  representing  Cr  as  above, 
we  have  assumed  (without  loss  of  generality)  that 

-iLoicr  =  Lwio. 

Since  N  is  a  ramification  of  Cq, 

W(=(ZG1  A...LGn)D  N.  (43) 

So,  the  following  instance  of  (43)  holds: 

W  1=  (Lai<r  A...  WO  3  No],  (44) 

Rewriting  (41)  gives 

IJW2  A  ...  A  Lwm  D  ->Lvvi,  (45) 

and  therefore 

W  (Lw2cr  A  ...  A  Lwma  A  LgiO  A  ...  A  Lcn&)  D  ->Lw\o.  (46) 

But,  b(7i ct  and  Lw\0  are  identical  literals,  so 

W  |=  (Lw2&  A  . . .  A  Lw-ma  A  Lg2&  A  . . .  A  LguCt)  D  jf-Gio.  (47) 

Combining  (47)  and  (44)  gives  the  desired  result: 

W  (=  (Z<w2Cr  A  . . .  A  Lw-mV  A  Lg2°  A  . . .  A  L(jno)  Zj  No,  (48) 

or  equivalently, 

W  |=  -i Cr  D  No.  (49) 

I 


4.7.2  Inheritance  under  G  Factoring  Steps 

Theorem  4.9  Suppose  C'g  is  a.  clause  from  G*,  and  Cr  is  a  factor  of  Cg  via  unifier  cr.  If 
N  is  a  ramification  of  ~>Cg  then  No  is  a  ramification  of  ~>Cr. 


Proof:  Let  us  represent  the  various  clauses  as  follows: 
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u>i  =  -'A  V  -<B  V  G  Gi  ~  G  —  AV  ~>C  V  G 


G<  =  ~'B  V  —<C  (*♦) 


(*)  -  N  is  ramification  of  G2. 

(**)  -  N  is  not  a  ramification  of  G4. 

Figure  10:  Non-Inheritance  of  Ramifications 


Cr<7  =  -iLgiO  V  .  . .  V  ~<LGn<7,  (51) 

where  Lij  represents  a  positive  or  negative  literal.  Note  that  in  representing  Cr  as  above, 
it  has  been  assumed  (without  loss  of  generality)  that  the  factoring  unifies  only  the  first  two 
literals,  that  is, 

Lc,\0  =  Leva-  (52) 

Since  N  is  a  ramification  of  Cg 


W\=(LGiA...LGn)DN.  (53) 

In  addit:on  the  instance  of  (53)  obtained  by  making  substitution  a  holds,  so: 

W  )=  {Lq\o  A  . . .  LcnV)  D  Ncr.  (54) 

But,  only  one  of  the  identical  literals  appears  in  (52),  that  is, 

wh  (Lena  A  . .  £<3„ct)  D  N<t].  (55) 

So,  from  (55)  the  desired  result  follows,  that  is, 

W  (=  -iCr  D  No.  (56) 

I 

4.7.3  Inheritance  under  GG  Resolution  Steps 

Ramifications  arc  not  necessarily  inherited  under  C.'C-Rcsoliif  ion.  Consider  (lie  following 
example: 
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Example  4.8  Figure  10  illustrates  a  goal  that  does  not  inherit  the  ramifications 
of  its  parents.  Oonisder  a  goal  G  and  rules  A  A  B  D  G,  ->4  A  C  D  G,  and  .4  D  /V. 
In  clausal  and  non-clausal  form,  these  are  as  follows: 

Clausal  Form  Non-Clausal 


Gr 

m 

ti>2 

W3 


->G 

-.4  V  ->B  V  G 
A  V  -<C  V  G 
-i4  V  N 


G 

A  A  B  D  G 
^A  A  C  D  G 
ad  N 


WG- Resolution  (Backchaining)  gives  two  new  goals 


G2  ~>4  V  -1 B 

G3  4  V  ->C 


A  AS 
->4  A  C 


Performing  GG-Resolution  011  G2  and  G3  gives  a  new  goal 
G4  -’ll  V  ~>C  B  A  C 

Note  that  G2  has  N  as  a  ramification  (via  w,  ).  Goal  G2  gives  rise  to  G4  via  GG- 
Resolution,  but  G4  does  not  inherit  N  as  a  ramification  (although  some  other 
line  of  reasoning  may  establish  N  as  a  ramification  of  G4). 


Theorem  4.10  Suppose  C ,\  and  C'b  are  clauses  from  G*,  Cr  is  a  resolvent  of  C a  and  C'b 
via  unifier  0 ,  and  N  is  a  ramification  of  ~>C’a-  Suppose  further  that  the  literal  on  which  C a 
and  Cb  are  resolved  is  not  a  member  of  all  foundations  of  N,  that  is, 


W  \=  (La\  A  . . .  !\  LA(i-\)  LA(i+\)  A  . .  .Lau)  D  N 

for  C,  1  =  L,  1,  A  ...  A  I, An.  Then  No  is  a  ramification  of  -1 Cr . 

Proof:  Let  us  represent  the  various  clauses  as  follows: 

C’a  =  -'Lai  V  ...  V  ~’L.4n 
C'b  =  -’Lai  V  . . .  v  -> Lam 

C rCT  =  —1 T  \'iO  V  ...  V  ->La„(T  V  -iLp 2<7  V  ...  V  ~'LBm^r 


(57) 


(58) 

(59) 

(60) 


where  LtJ  represents  a  positive  or  negative  literal.  Note  that  in  representing  Cr  as  above, 
we  have  assumed  (without  loss  of  generality)  that 

-'La \a  =  Lr\o. 


u 
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The  result  follows  immediately  from  (57).  We  know  that  the  instance  of  (57)  obtained 
by  substitution  a  must  hold,  that  is, 

W  h  (WA . LAn)aDtia.  (61) 

Note  that  (61)  is  derivable  fiom  (57)  assuming  that  i  =  1,  that  is,  the  first  literal  is  the  one 
resolved  away.  Since  ->Cr  contains  a  superset  of  the  literals  needed  in  (61)  to  prove  N<r,  it 
must  also  be  sufficient  to  prove  N<r.  Thus  we  have, 

W  (X.42cr  A  . . .  A  LAnj  A  Lb7C  A  . . .  A  Lf)mo)  D  No,  (62) 

or 

W  (=  ~*C  ro  D  No.  (63) 


4.8  Related  Work 

4.8.1  McSkimin  and  Minker 

McSkimmin  and  Minker  [69]  was  an  early  effort  at  using  semantic  information  to  improve 
efficiency  of  database  queries.  Via  information  stored  in  a  semantic  network14,  McSkimmin 
and  Minker  (1)  allowed  unification  of  variables  only  with  variables  from  the  same  domain 
( semantic  unification),  (2)  checked  to  see  if  a  query  is  inconsistent  with  a  series  of  allowed 
forms  ( semantic  well-formedness),  and  (3)  based  on  information  on  the  number  of  possible 
answers  to  a  query,  checked  to  see  whether  all  the  answers  to  a  query  had  been  found  and 
therefore  no  more  search  is  needed  ( semantic  actions). 

4.8.2  Stallman  and  Sussman 

Stallman  and  Sussman ’s  EL  [90]  was  among  the  first  to  explicitly  use  forward  reasoning  in  a 
declarative  form  to  restrict  a  search. 10  EL’s  goals  consisted  of  a  set  of  variables  to  be  bound 
in  such  a  way  as  to  be  consistent  with  a  model  of  various  electrical  circuit  components.  The 
values  of  the  variables  were  currents,  voltages,  and  states  of  transistors.  The  system  had  no 
backwards  reasoning  component  or  database  lookup  component,  but  rather  had  procedures 

uTlie  semantic  network  used  by  McSkimmin  and  Minker  was  very  close  to  prediacte  logic,  however. 
'■'Earlier,  David  Waltz  [102]  line-labeling  program  had  propagated  constraints  via  special-purpose 
procedures. 
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for  guessing  values  of  variables.  After  each  guess,  it  reasoned  forward  to  find  ramifications 
of  the  existing  bindings  of  its  variables.  Rather  than  allow  random  forward  inference,  EL’s 
inference  was  limited  to  one-step  deductions,  that  is,  instantiation  of  Horn  clauses  of  the 
form 

Ai  A  ...A  Am  D  N, 

where  all  the  A^’s  were  known  to  be  true.  After  a  new  design  decision  was  made,  EL  would 
perform  all  possible  one-step  deductions  in  an  attempt  to  either  show  the  design  decisions 
to  date  to  be  inconsistent,  or  else  to  derive  constraints  on  remaining  circuit  parameters. 

4.8.3  MYCIN 

While  primarily  a  backwards  inference  system,  in  the  Mycin  System  (ShortlifFe,  Buchanan, 
et  al  [96,9,17])  it  was  beneficial  to  use  a  form  of  interleaved  forward  and  backwards  inference 
in  certain  cases.  The  backwards  search  of  MYCIN  possessed  state  information  in  what  was 
called  contexts.  This  state  information  was  used  in  conjunction  with  forward  reasoning 
for  two  purposes.  First,  the  preview  mechanism  acted  as  a.  filter  on  rules,  that  is,  if  the 
premise  of  a  rule  could  be  immediately  proven  to  be  false,  the  rule  could  be  eliminated  from 
consideration.  Consider  a  rule  A  A  B  D  C.  Since  large  amounts  of  inference  and  (more 
importantly)  interaction  with  the  user  might  be  involved  in  trying  to  establish  A,  it  was 
critical  to  prune  this  rule  if  B  was  already  known  to  be  false.  The  second  use  of  forward 
reasoning  was  similar  to  using  ramifications.  In  order  to  smooth  the  interaction  with  users, 
it  was  necessary  to  ask  questions  in  a  fairly  constrained  fashion.  One  method  for  doing  so 
was  to  force  a  set  of  questions  to  be  asked  whenever  a  context  was  instantiated.  Antecedent 
rules  were  used  to  see  whether  answers  to  questions  had  already  uniquely  specified  the 
answer  to  other  questions,  and  thus,  obviate  the  need  to  ask  the  certain  questions.  The 
forward  reasoning  was  in  the  form  of  one-step  deductions,  as  in  EL,  and  results  of  one-step 
deductions  could  cause  other  one-step  deductions  to  be  triggered. 

4.8.4  Stefik’s  MOLGEN 

Mark  Stefik’s  MOLGEN  planner  [93,92]  worked  in  the  domain  of  genetics  experiment 
design.  His  basic  notion  was  that  in  this  domain,  a  hierarchical  approach  to  experiment 
design  would  require  very  little  backtracking  if  only  the  constraints  created  at  any  point  are 
immediately  propagated  to  the  rest  of  the  plan.  At  every  point  it  was  preferred  to  either 
make  decisions  for  which  only  a  single  choice  is  possible  or  propagate  a  constraint  rather 
than  making  guesses  that  might  have  to  be  retracted.  Stefik  dubbed  such  search  control 
the  least  commitment  cycle. 
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Constraint  satisfaction  and  propagation  in  MOLGEN  was  done  by  LISP  procedures  \ 

for  each  constraint  or  constraint  type.  Each  operator  introduced  into  the  plan  could  also  < 

( 

introduce  a  set  of  constraints,  and  had  the  ability  to  regress  or  progress  constraints  through  « 

its  action.  One  can  view  Stefik’s  constraints  as  a  special  class  of  prerequisite,  a  prerequisite  j 

that  one  should  satisfy  via  as  a  side  effect  of  other  actions  or  variable  choices  in  the  plan 
rather  than  try  to  satisfy  via  introduction  of  special  actions  for  this  purpose.  j 

4.8.5  King’s  QUIST  j 

Jonathan  King ’s  Ph.D.  research  [44]  was  embodied  in  a  system  called  QUIST  for  “query  ' 

optimization  by  semantic  reasoning.”  The  notion  was  that  besides  standard  syntactic  trans¬ 
formations  on  database  queries,  semantic  restrictions  upon  the  database  could  be  used  to 
reformulate  a  query  as  a  less  expensive  query.  For  example,  if  all  ships  above  a  certain 
tonnage  is  known  to  be  supertankers,  and  it  is  cheaper  to  find  all  supertankers  than  all 
ships,  this  fact  could  be  used  in  answering  queries  about  ships.  King  defined  the  notion  of 
semantic  equivalence  transformations ,  transformations  of  a  query  Q  to  a  query  Q'  such  that 
are  not  logically  equivalent,  but  for  every  permitted  interpretation,  the  two  queries  have  the 
same  set  of  answers. 

In  order  to  generate  semantically  equivalent  queries,  QuiST  defined  a  set  of  transfor¬ 
mations  allowing  the  system  to  do  one-step  deductions  and  manipulate  conjunctions  and 
disjunctions.  The  process  produced  new,  semantically  equivalent  queries  that  could  be 
cheaper  to  solve  by  the  same  mechanisms  outlined  in  Section  3.6.  Estimates  of  costs  of 
queries  were  established  by  working  with  well-defined  models  of  the  database  and  queries. 

It  considered  only  the  class  of  restrict-join-project  queries,  an  incomplete  but  very  useful 
class  of  relational  database  queries,  and  used  a  simple  model  of  access  and  storage  (based 
in  the  work  of  Blasgen  and  Eswaren  [3]  at  IBM)  to  measure  the  cost  of  processing  a  given 
query. 

4.8.6  Kohli  and  Minker 

In  [45],  Madhur  Kohli  and  Jack  Minker  proposed  controlling  backwards  searcli  by  using 
integrity  constraints.16  Their  paper  dealt  with  logic  programs  written  in  function-free  or¬ 
dered  Horn  clauses,  and  it  assumed  the  presence  of  integrity  constraints  on  the  database, 
also  expressed  as  Horn  clauses.  Like  Ordered  Resolution  on  IIOH-clauses  (See  Section  2.1), 
and  in  contrast  to  Prolog,  the  search  strategy  is  not  restricted  to  be  depth-first.  The  paper 
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proposed  checking  every  goal  generated  against  all  integrity  constraints,  the  pruning  of  in¬ 
consistent  goals,  and  notions  similar  to  generators  and  filters.17  Kohli  and  Minker  cached 
ramifications  they  derived  (“implicit  integrity  constraints”  in  their  terminology),  and  be¬ 
cause  only  Horn  clauses  were  considered,  all  ramifications  are  inherited  from  parent  goals 
(See  Section  4.7). 

4.8.7  Chakravarathy,  et  al 

In  [12,14,13]  U.  S.  Chakravarthy,  Jack  Minker  and  their  colleagues  have  described  the 
extension  of  Semantic  Query  Optimization  to  deal  with  deductive  databases.18  In  Chakra- 
varthy’s  system,  it  is  assumed  that  there  will  be  many,  many  queries  and  a  fairly  limited 
number  of  integrity  constraints.  Rather  than  trying  to  optimize  each  query  as  King  does, 
Chakravarthy’s  system  stores  appropriate  supersumptions  with  each  intensional  and  exten- 
sional  relation.  When  a  query  is  presented  to  the  system,  it  may  then  quickly  see  which 
of  its  cached  supersumptions  are  appropriate.  Given  the  huge  potential  speedup  on  large 
databases,  it  is  worth  the  overhead  of  deriving  and  storing  supersumptions  with  each  rela¬ 
tion.  Note  that  it  is  usually  combinations  of  relations  in  a  query  that  make  supersumptions 
useful.  To  cache  all  potentially  useful  supersumptions  for  each  relation  is  combinatorically 
explosive,  but  given  Chakravarthy’s  assumptions  about  the  number  of  queries  versus  the 
number  of  relations  and  integrity  constraints,  it  is  reasonable  to  do  so. 

Chakravarthy  starts  with  integrity  constraints  and  intensional  and  extensional  database 
definitions  stated  in  clausal  form.  To  find  potentially  useful  supersumptions,  Chakravarthy 
looks  at  partial  subsumptions  of  definitions  of  relations,  that  is,  definitions  that  are  sub¬ 
sumed  by  a  subset  of  literals  in  an  integrity  constraint.  Using  subsumption  rather  than 
simple  unification  forces  the  supersumption  to  hold  for  all  values  of  all  variables.  The  treat¬ 
ment  is  quite  similar  to  the  Vrps  Procedure  of  this  thesis  (See  Section  4.6,  and  the  name 
Prps  “resolution  with  partial  subsumption”  was  chosen  to  point  out  the  similarity  with 
Chakravarthy’s  approach  to  database  optimization. 

A  number  of  other  researchers  have  consider  semantic  query  optimization  on  databases, 
including  Hammer  and  Zdonik  [35],  Xu  [107]  and  Jarke  et  al  [43]. 

1 '  It  is  not  clear  to  the  present  author  that  the  mechanism  proposed  for  generators  is  guaranteed  to  produce 
only  correct  answers. 

lf>As  described  in  Reiter  [75],  databases  are  divided  into  extensional  and  intensional  relations,  where 
extensional  relations  are  relations  that  can  be  looked  up  in  the  database,  whereas  intensional  relations  must 
be  reduced  to  combinations  of  intensional  relations.  The  reduction  of  intensional  relations  takes  place  via 
rules  from  which  the  extensional  definition  of  an  intensional  relation  may  be  deduced  —  hence  the  name 
deductive  database. 
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4.8.8  Lee,  et  al. 

Theorem  4.2,  Section  4.5  shows  that  although  resolution  in  not  deductively  complete,  it  can 
deduce  a  clause  that  subsumes  any  clause  implied  by  the  base  set.  Although  the  proof  was 
done  independently,  it  turns  out  that  a  virtually  identical  result  was  published  exactly  20 
years  ago  in  Richard  C.  T.  Lee’s  PhD  thesis  [50]  and  extended  to  linear  resolution  (of  which 
set  of  support  is  a  special  case  in  this  case)  by  Minicozzi  and  Reiter  [70]  in  1972.  Thus,  it  is 
only  fair  to  view  Theorem  4.2  and  Theorem  4.6.3  as  very  minor  variations  on  these  earlier 
results.  Thanks  to  Richard  Waldinger  and  Mark  Stickel  for  pointing  out  this  research  to 
me. 


4.9  Summary 

This  chapter  has  explored  deduction  of  ramifications  from  a  conjunctive  goal  G  =  Gi  A  . . .  A 
Gm,  where  the  G,  are  atomic  formulas.  Two  resolution-based  procedures,  Vrgc  and  Vrps 
are  shown  to  be  complete  for  this  problem,  though  not  in  the  usual  sense  of  generating 
all  possible  ramifications.  Instead  they  are  uinplete  in  that  for  any  ramification  N,  both 
of  these  procedures  will  generate  a  ramification  that  is  at  least  as  “strong”  as  N.  More 
precisely,  for  every  clause  N,  in  a  conjunct  normal-form  of  N,  Vrgc  and  Vrps  can  generate 
a  clause  N'  that  subsumes  N,.  Vrps  is  superior  to  Vrgc  in  that  it  is  completely  within  the 
framework  of  resolution  on  the  world  model  W,  that  is,  Vrps  is  a  restriction  on  resolutions 
on  base  set  W  that  disallows  resolutions  unrelated  to  the  goal  G  at  hand,  but  still  allows 
all  needed  ramifications  to  be  found. 

In  database  retrievals,  a  single  goal  G  is  given  and  variable  bindings  for  x  must  be  found 
such  that  G  holds.  As  discussed  in  Chapter  2,  the  problems  considered  here  involve  goal 
reduction,  that  is,  reduction  of  the  orignal  goal  to  other  goals  via  backwards  reasoning. 
Because  ramifications  could  be  deduced  from  any  such  goal,  it  is  important  to  consider 
whether  the  search  for  ramifications  of  one  goal  can  be  used  in  the  search  for  ramifications 
of  another  goal.  There  are  two  forms  of  sharing  of  search  explored  in  this  chapter:  ( 1 ) 
Inheritance  of  ramifications  from  parent  goals,  and  (2)  Caching  of  ramifications  along  with 
the  goal  conjuncts  from  which  they  were  derived.  Section  4.7  derived  results  for  what 
ramifications  are  and  are  not  inherited  from  parents.  A  simple  modification  of  Vrgc  for 
caching  ramifications  and  the  conjuncts  on  which  they  are  based  is  presented  in  Section  1.5.3. 
Vrps,  on  the  other  hand,  handles  the  caching  of  ramifications  in  a  much  more  natural 
fashion,  at  all  times  recording  the  goal  conjuncts  (and  only  those  goal  conjuncts)  on  which 
a  deduction  of  a.  ramification  is  based. 


Chapter  5 

Conclusion 


Synthesis  problems  constitute  a  major  class  of  problems  encountered  in  many  fields.  Robot 
planning,,  circuit  design,  automatic  generation  of  diagnostic  tests,  program  synthesis,  and 
automatic  theorem  proving  are  among  the  synthesis  problems  commonly  encountered  in  the 
AI  literature.  Automatic  design  synthesis  has  been  of  interest  since  the  very  beginnings  of 
AI.  Deductive  approaches  to  synthesis  problems,  that  is,  constructing  a  design  as  part  of 
the  proof  of  a  theorem,  date  from  the  work  of  Green  [32]  and  Waldinger  and  Lee  [99]  in  the 
late  1960’s. 

The  current  research  continues  in  the  tradition  of  deductive  design  synthesis.  In  previous 
deductive  syntl  esis,  design  has  been  a  process  of  backwards  reasoning  from  a  goal  formula, 
representing  the  design  as  a  term  in  the  logic.  In  reasoning  only  backwards  during  the  design 
process,  such  systems  have  not  considered  interactions  of  various  parts  of  the  design  already 
specified  with  each  other  and  with  the  remaining  subgoals.  In  addition,  representation  of 
the  design  as  a  term,  that  is,  a  composition  of  functions,  has  made  it  unnatural  to  reason 
about  the  design,  and  has  limited  the  set  of  design  decisions  that  can  easily  be  expressed. 

This  thesis  has  two  main  themes.  First,  for  reasonable  behavior  over  a  wide  spectrum 
of  goals,  the  design  process  should  be  bidirectional: ;  one  should  reason  backwards  from  the 
goal  (goal  reduction)  and  forward  from  the  goal  and  any  design  decisions  that  have  been 
made  (consistency  checking  and/or  supersumption).  Second,  designs  should  be  represented 
as  formulas  rather  than  as  terms.  By  doing  so  one  gains  expressiveness  in  representing 
design  decisions  and  the  ability  to  reason  directly  about  the  design. 

The  main  points  of  this  thesis  are  summarized  in  the  following  sections.  The  first 
presents  the  main  contributions  of  this  thesis.  The  second  section  summarizes  its  main 
limitations,  and  the  third  suggests  directions  for  future  work  in  this  area  and  improvements 
of  this  thesis. 
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5.1  Summary  of  Contributions 

5.1.1  A  Framework  for  Design 

Chapter  2  defined  a  residue ,  a  new  definition  for  specification  of  a  design  object.  In  this 
formulation,  both  complete  and  incomplete  designs  are  represented  as  single  formula  of 
first*order  predicate  calculus.  A  formula  is  a  residue,  i.e.,  a  legal  design,  if  it  (1)  logically 
implies  the  goal  specification,  (2)  is  consistent  with  the  set  of  axioms  describing  the  world, 
and  (3)  is  a  conjunction  of  formulas,  each  of  which  can  be  assumed  to  be  achievable  in  the 
world  being  modeled. 

The  chief  advantage  of  Residue’s  approach  is  that  the  system  can  use  the  entire  set  of 
relations  of  the  logic  to  be  used  to  express  constraints  on  the  design.  The  ability  to  express 
all  desired  design  constraints  is  crucial  during  the  design  process,  when  imposition  of  an  un¬ 
necessarily  strong  constraint  can  lead  to  needless  backtracking.  In  contrast,  representation 
of  the  design  as  a  term  limits  the  expressible  design  constraints  to  those  for  which  there  is 
a  single  pre-existing  function. 

Consistency  checking  corresponds  to  seeing  that  all'  the  constraints  imposed  upon  the 
design  can  be  realized  at  once,  an  integral  part  of  complicated  design  problems.  Without  the 
ability  to  check  consistency  of  a  design,  one  may  not  partially  specify  a  component  on  which 
other,  possibly  inconsistent,  constraints  will  later  be  imposed.  For  a  design  represented  as  a 
formula,  this  notion  corresponds  to  consistency  of  the  design  formula  with  the  set  of  axioms 
describing  the  world.  In  contrast,  checking  consistency  of  a  design  expressed  as  a.  single 
term  is  an  ad  hoc  process.  There  is  no  general  way  to  use  a  set  of  axioms  describing  the 
world  to  check  whether  the  object  denoted  by  a  given  term  is  consistent  with  that  set  of 
axioms.  As  a  result,  single-term  approaches  have  usually  been  limited  to  design  problems 
in  which  consistency  checking  is  not  needed,  a  severe  limitation. 

5.1.2  Procedure  for  Design  Synthesis 

Chapter  2  also  described  two  procedures,  Resolution  Residue  and  Ordered  Residue,  for 
generation  of  residues.  Both  of  these  procedures  work  by  reducing  a  goal  specification  via 
backwards  reasoning  to  a  set  of  primitively  achievable  specifications.  Resolution  Residue 
uses  binary  resolution  for  its  backwards  inference,  and  Ordered  Residue  uses  an  ordered 
resolution  on  Horn  clauses. 

For  both  residue  procedures,  appropriate  completeness  results  were  proven.  The  com¬ 
pleteness  results  show  that  although  not  every  residue  can  be  generated,  for  every  residue 
not  generated,  a  residue  at  least  as  general  will  be  generated. 
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5.1.3  Supersumption 

This  research  has  dealt  with  the  derivation  and  use  of  constraints  derived  from  a  goal  (or 
subgoal)  via  forward  reasoning.  Such  constraints  are  called  ramifications  and  the  process 
of  using  those  constraints  is  called  supersumption.  Supersumption  captures  the  notion  that 
one  should  not  only  consider  the  consistency  of  a  partially  specified  solution,  but  should  be 
able  to  make  use  of  any  conditions  necessary  for  consistency. 

The  Residue  Procedure  allows  pruning  of  inconsistent  designs  during  the  design  proce¬ 
dure.  Supersumption  is  a  way  to  (1)  avoid  generating  some  of  those  inconsistent  designs, 
and  (2)  avoid  some  of  the  overhead  of  consistency  checking  over  a  large  set  of  designs. 

5.1.4  Procedure  for  Finding  Ramifications 

Chapter  4  presented  two  procedures  for  finding  ramifications,  constraints  necessary  for  con¬ 
sistency  of  a  set  of  design  constraints  and  any  remaining  subgoal.  Both  of  these  procedures 
find  ramifications  as  a  part  of  of  the  checking  consistency  process. 

The  first  procedure  Vrgc  uses  resolution  as  a  deduction  procedure  rather  than  its  usual 
use  as  a  refutation  procedure.  Although  resolution  is  not  deductively  complete,  the  com¬ 
pleteness  results  of  this  chapter  show  that  for  every  ramification  not  derivable  by  resolution, 
a  ramification  that  is  at  least  as  strong  is  derivable.  In  addition,  Vrgc  uses  conjuncts  of 
the  goal  as  a  set  of  support  from  which  to  perform  resolutions.  This  has  the  important 
propel  ty  that  any  ramification  that  is  derivable  is  derivable  without  resolving  random  facts 
about  the  world  with  each  other  —  all  ramifications  are  derivable  from  goal  conjuncts. 

The  second  of  the  two  procedures,  Vrps ,  has  all  the  above  properties  of  the  Vrgc  with 
one  major  difference:  Vrps  derives  ramifications  in  the  form  of  facts  that  may  be  directly 
added  to  the  database  of  axioms  about  the  world.  Such  facts  record  the  ramification  derived, 
and  the  goal  conjuncts  that  were  needed  to  derive  that  ramification.  By  recording  such  facts, 
other  goals  with  conjuncts  in  common  (modulo  variable  names)  may  use  the  result  of  the 
previous  derivation  of  ramifications. 

5.2  Main  Limitations  of  the  Approach 

5.2.1  Assumable  Formulas  must  be  Atomic 

Chapter  2  makes  the  assumption  that  the  design  will  consist  of  a  set  of  primitively  achievable 
atomic  formulas.  Disjunctions  and  conditions  are  thus  excluded  from  designs.  This  assump¬ 
tion  appears  both  in  the  completeness  theorems  of  Chapter  2  as  well  as  in  the  Resolution 
Residue  and  Ordered  Residue  procedures.  In  these  procedures,  designs  arc  represented  as 


5.2.  MAIN  LIMITATIONS  OF  THE  APPROACH 


103 


the  negation  of  a  single  clause,  that  is,  a  design  is  a  conjunction  of  literals.  To  represent  a 
design  containing  disjunctions,  multiple  clauses  would  have  to  be  used  for  the  design. 

The  severity  of  this  restriction  is  not  entirely  clear.  At  first  glance  it  would  appear 
that  conditional  plans  would  be  impossible  to  build  as  residues,  but  in  fact  they  have  been 
synthesized.  To  do  so,  it  was  necessary  only  to  represent  actions  as  mappings  to  one  of 
several  possible  succeeding  states  based  upon  the  outcome  of  that  action.  It  is  as  yet 
unknown  whether  procedures  can  be  found  for  generating  non-atomic  residues,  and  under 
what  circumstances  it  is  desirable  to  generate  designs  out  of  such  components. 


5.2.2  Design  and  Subdesigns  Have  No  Name 

The  Residue  Procedure,  in  contrast  to  the  single-term  approach,  does  not  reify  the  design; 
there  is  no  perfectly  natural  way  to  refer  to  the  design  as  a  whole,  nor  is  there  a  way  to 
refer  to  portions  of  the  design.  Instead,  t!/  -  t  of  design  decisions  comprising  the  design 
are  stated  as  facts  describing  the  entire  world.  For  example,  it  is  awkward  to  say  that 
the  cost  of  the  design  must  be  less  than  $1.00.  One  can  probably  avoid  this  problem  by 
introducing  a  relation  on  design  components  and  designs  (or  subdesigns)  such  that  the 
relation  holds  for  every  component  of  a  particular  design  (or  subdesign),  but  to  date,  use 
of  such  axiomatizations  has  not  been  explored. 


5.2.3  Rederivation  of  Cached  Deductions 

A  good  deal  of  the  power  of  supersumption  comes  from  its  caching  of  ramifications;  the 
cached  ramifications  can  be  used  to  filter  (prune)  other  design  candidates  without  the 
ramification  being  rederived.  In  this  thesis,  as  in  many  other  systems,  the  caching  of  the 
result  of  a  series  of  deductions  does  not  mean  that  those  deductions  will  not  be  performed 
again  as  forward  reasoning  from  another  goal. 

In  systems  for  which  it  is  important  to  preserve  completeness,  it  is  a  difficult,  problem 
knowing  when  old  results  can  be  reused  without  further  exploration  of  the  path  on  which 
they  were  found.  The  old  path  might  not  have  been  completely  explored,  or  new  facts 
might  enable  derivation  of  new  results  that  were  not  possible  when  the  path  was  previously 
explored. 
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5.3  Further  Work 

5.3.1  Control  Heuristics  for  Residue 

Besides  the  usual  search  control  issues  in  searching  a  search  space,  Residue  involves  both 
forward  reasoning  (consistency  checking)  and  backwards  reasoning  (goal  reduction).  Al¬ 
though  it  is  not  logically  necessary  to  check  consistency  of  a  design  until  it  is  complete,  one 
of  the  major  motivations  for  Residue  is  the  ability  to  quickly  prune  inconsistent  designs. 
To  date,  there  has  not  been  work  on  when  consistency  checking  is  heuristically  appropriate 
during  the  design  process.  Similarly,  there  is  little  known  on  how  much  effort  to  spend 
checking  consistency  at  any  given  point.  Such  a  decision  will  depend  upon  such  factors 
as  the  branchiness  of  an  upcoming  decision  point,  how  close  to  being  overconstrained  the 
design  currently  is,  and  the  extent  to  which  the  upcoming  decision  is  perceived  to  further 
constrain  the  set  of  design  candidates. 

5.3.2  Cost  of  Solving  a  Problem 

In  order  to  know  when  to  use  a  given  supersumption,  one  must  be  able  to  estimate  the  cost 
of  finding  a  solution  to  a  given  goal.  To  date,  we  have  good  models  only  under  strong  sets 
of  assumptions.  In  particular,  we  are  good  at  estimating  the  cost  of  lookups  on  conjunctive 
queries  in  extensional  databases.  For  subgoals  whose  solutions  involve  backwards  reasoning, 
there  is  still  very  little  work. 

5.3.3  Control  Heuristics  for  Finding  Ramifications 

As  was  discussed  in  Chapter  4,  ramifications  can  be  found  as  part  of  the  process  of  consis¬ 
tency  checking.  If  one  checks  consistency  of  partially  completed  designs,  there  is  a  tradeoff 
of  forward  and  backwards  reasoning  (as  mentioned  above  in  Section  5.3.1).  A  similar  trade¬ 
off  exists  when  forward  reasoning  is  used  to  find  ramifications  for  supersumption;  good 
heuristics  to  decide  when  to  do  such  forward  reasoning  and  what  forward  reasoning  are 
necessary.  There  may  be  cases  where  it  would  pay  off  to  look  for  ramifications,  whereas 
consistency  checking  would  not  be  called  for. 

5.3.4  Probable  Constraints 

The  above  research  has  involved  finding  ramifications  of  a  goal.  As  was  mentioned  in 
Chapter  3,  supersumption  can  be  used  with  other  constraints  as  well.  For  problems  in  which 
not  all  solutions  to  a  problem  are  needed,  use  of  probable  constraints,  that  is.  constraints 
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derived  by  plausible  reasoning  from  a  goal,  is  at  least  as  useful  as  supersumption  using  only 
ramifications. 

The  general  use  of  probable  constraints  is  equivalent  to  heuristic  search.  Finding  prob¬ 
able  constraints  by  plausible  reasoning  from  goals  (and  subgoals)  and  the  addition  of  such 
constraints  to  the  goal  at  hand  is  a  very  specific  form  of  heuristic  search  guidance.  It  has  the 
same  control  problems  as  supersumption  with  ramifications,  and  heuristics  for  search  con¬ 
trol  applicable  to  supersumption  with  ramifications  should  be  applicable  to  supersumption 
with  probable  constraints  as  well. 
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